Data

From wiki.gis.com
Jump to: navigation, search

Data are pieces of information that represent the qualitative or quantitative attributes of a variable or set of variables. Data (plural of "datum", which is seldom used) are typically the results of measurements and can be the basis of graphs, images, or observations of a set of variables. Data are often viewed as the lowest level of abstraction from which information and knowledge are derived.

Etymology

The word data (pronounced /ˈdeɪtə/, /ˈdætə/, or /ˈdɑ˝tə/) is the Latin plural of datum, neuter past participle of dare, "to give", hence "something given". The past participle of "to give" has been used for millennia, in the sense of a statement accepted at fn, Data). In discussions of problems in geometry, mathematics, engineering, and so on, the terms givens and data are used interchangeably. Such usage is the origin of data as a concept in computer science: data are numbers, words, images, etc., accepted as they stand.

Usage in English

In English, the word datum is still used in the general sense of "something given". In cartography, geography, nuclear magnetic resonance and technical drawing it is often used to refer to a single specific reference datum from which distances to all other data are measured. Any measurement or result is a datum, but data point is more common,[1] albeit tautological. Both datums (see usage in datum article) and the originally Latin plural data are used as the plural of datum in English, but data is more commonly treated as a mass noun and used in the singular, especially in day-to-day usage. For example, "This is all the data from the experiment". This usage is inconsistent with the rules of Latin grammar and traditional English,[citation needed] which would instead suggest "These are all the data from the experiment".

Some British and UN academic, scientific and professional style guides[2] request that authors treat data as a plural noun. Other international organizations, such as the Institute of Electrical and Electronics Engineers Computer Society,[3] allow its usage as either a mass noun or plural based on author preference. The Air Force Flight Test Center, in its publication The Author's Guide to Writing Air Force Flight Test Center Technical Reports specifically states that the word data is always plural, never singular.

Data is now often treated as a singular mass noun in informal usage, but usage in scientific publications shows a divide between the United States and United Kingdom. In the United States the word data is sometimes used in the singular, though scientists and science writers more often maintain the traditional plural usage. Some major newspapers such as The New York Times use it alternately in the singular or plural. In the New York Times the phrases "the survey data are still being analyzed" and "the first year for which data is available" have appeared on the same day. In scientific writing data is often treated as a plural, as in These data do not support the conclusions, but many people now think of data as a singular mass entity like information and use the singular in general usage."[4] British usage now widely accepts treating data as singular in standard English,[5] including everyday newspaper usage[6] at least in non-scientific use.[7] UK scientific publishing still prefers treating it as a plural.[8] Some UK university style guides recommend using data for both singular and plural use[9] and some recommend treating it only as a singular in connection with computers.[10]

'Raw data refers to a collection of numbers, characters, images or other outputs from devices to convert physical quantities into symbols, that are unprocessed. Such data is typically further processed by a human or input into a computer, stored and processed there, or transmitted (output) to another human or computer (possibly through a data cable). Raw data is a relative term; data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next.

Mechanical computing devices are classified according to the means by which they represent data. An analog computer represents a datum as a voltage, distance, position, or other physical quantity. A digital computer represents a datum as a sequence of symbols drawn from a fixed alphabet. The most common digital computers use a binary alphabet, that is, an alphabet of two characters, typically denoted "0" and "1". More familiar representations, such as numbers or letters, are then constructed from the binary alphabet.

Some special forms of data are distinguished. A computer program is a collection of data, which can be interpreted as instructions. Most computer languages make a distinction between programs and the other data on which programs operate, but in some languages, notably Lisp and similar languages, programs are essentially indistinguishable from other data. It is also useful to distinguish metadata, that is, a description of other data. A similar yet earlier term for metadata is "ancillary data." The prototypical example of metadata is the library catalog, which is a description of the contents of books.

Experimental data refers to data generated within the context of a scientific investigation by observation and recording.

Meaning of data, information and knowledge

The terms information and knowledge are frequently used for overlapping concepts. The main difference is in the level of abstraction being considered. Data is the lowest level of abstraction, information is the next level, and finally, knowledge is the highest level among all three.[citation needed] For example, the height of Mt. Everest is generally considered as "data", a book on Mt. Everest geological characteristics may be considered as "information", and a report containing practical information on the best way to reach Mt. Everest's peak may be considered as "knowledge".

Information as a concept bears a diversity of meanings, from everyday usage to technical settings. Generally speaking, the concept of information is closely related to notions of constraint, communication, control, data, form, instruction, knowledge, meaning, mental stimulus, pattern, perception, and representation.

Beynon-Davies [11] uses the concept of a sign to distinguish between data and information. Data are symbols. Information occurs when symbols are used to refer to something.

According to Mr.Wayne Tomasi, information is knowledge or intelligence whereas, data is a processed, organized and stored information.[citation needed]

See also

References

  1. Writing Reports
  2. World Health Organization Style Guide, p 43
  3. IEEE Computer Society Style Guide
  4. http://www.bartleby.com/61/51/D0035100.html
  5. New Oxford Dictionary of English, 1999
  6. ...in educated everyday usage as represented by The Guardian newspaper, it is nowadays most often used as a singular. [1]
  7. Oxford definition
  8. [2] Data: Published by Blair Wisconsin International University singular or plural?
  9. Singular or plural
  10. http://openlearn.open.ac.uk/mod/resource/view.php?id=182902
  11. Beynon-Davies P. (2002). Information Systems: an introduction to informatics in Organisations. Palgrave, Basingstoke, UK. ISBN 0-333-96390-3

External links

This article was originally based on material from the Free On-line Dictionary of Computing, which is licensed under the GFDL.