Metadata

CA Erwin Data Modeler
Get A Free Trial Of CA ERwin® Data Solutions & Enable Corp Success.
www.CA.com

Optimize Metadata
Leverage metadata to achieve data governance goals. Free white paper.
www.informatica.com

End-to-End Metadata Management Software
Data Advantage Group's Metacenter repository is an end-to-end Metadata Management Solution for enterprise systems, including unlimited metadata customization, data lineage, impact analysis.
www.dag.com

Metadata at Amazon
Buy books at Amazon.com and save. Qualified orders over $25 ship free.
Amazon.com/books

Metadata Files
Find and Compare prices on metadata files at Smarter.com.
www.smarter.com

Pleaseerase
Server based Ms Word meta data scrubber designed for intranets. Cleans comments, tracked changes, hidden text, standard and custom fields.
www.pleasetech.com

Metadata
Metadata Save on exactly what you need.
www.half.com

Metadata - Wikipedia, the free encyclopedia
An item of metadata may describe an individual datum, or content item, or a ... be used with caution as all data is about something, and is therefore metadata. ...
en.wikipedia.org

Frequently Asked Questions on Metadata
What tools are available to check the structure of metadata? ... But I have already been using a template for metadata that mp can't read. ...
geology.usgs.gov

Understanding Metadata
retrieval, re-purposing, storage, metadata, and preservation. NISO ... the library environment, metadata. is commonly used for any formal ...
www.niso.org




Warning: mkdir() [function.mkdir]: Permission denied in /home/webs/affiliatelib2/CacheManager.php on line 12

Warning: mkdir() [function.mkdir]: No such file or directory in /home/webs/affiliatelib2/CacheManager.php on line 12

Warning: fopen(/home/templatecore2cache//*cluesnet.com/d1/d1aff20a3fc552ad0a1ddc613ead4cdc2ceade9b.tc2cache) [function.fopen]: failed to open stream: No such file or directory in /home/webs/affiliatelib2/CacheManager.php on line 130

Warning: fwrite(): supplied argument is not a valid stream resource in /home/webs/affiliatelib2/CacheManager.php on line 131

Warning: fclose(): supplied argument is not a valid stream resource in /home/webs/affiliatelib2/CacheManager.php on line 132



Metadata is data about data. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items.

Metadata (sometimes written 'meta data') is used to facilitate the understanding, use and management of data. The metadata required for effective data management varies with the type of data and context of use. In a library, where the data is the content of the titles stocked, metadata about a title would typically include a description of the content, the author, the publication date and the physical location. In the context of a camera, where the data is the photographic image, metadata would typically include the date the photograph was taken and details of the camera settings. On a portable music player such as an Apple iPod, the album names, song titles and album art embedded in the music files are used to generate the artist and song listings, and are metadata. In the context of an information system, where the data is the content of the computer files, metadata about an individual data item would typically include the name of the field and its length. Metadata about a collection of data items, a computer file, might typically include the name of the file, the type of file and the name of the data administrator.

What is metadata? Any item of data is a description of something. Metadata is a type of data where the something being described is data. Or, as it is often put, metadata is data about data.If we consider a particular place in the real world, this may be described by many items of data, for example:

To make sense of and use this data, it is necessary to have access to some form of description of the sort of data it is, or, in other words, have access to its metadata. So, for example, the metadata for the above three items of data might include:

An item of metadata is itself data and therefore may have its own metadata. This might (not particularly usefully) be referred to as meta-metadata. So, for example, “Post Code” might have the following metadata:

“27th June 2006” might have the following metadata:

The hierarchy of data, metadata, meta-metadata etc. can go on forever. Fortunately we have sufficient background knowledge so that we can usually make sense of and use an item of data with access to very little, if any, formally defined metadata. So, for example, with the “Post Code” metadata “8 characters, starting with A – Z” , it would be possible using background knowledge to know that this is a description of the format of a post code, without having access to any defined metadata for “8 characters, starting with A – Z”.

Levels As indicated, there are hierarchies of data and metadata. However, any particular item of data may be on different levels of a hierarchy depending on the context. For example, when considering the geography of London, “E83BJ” would be data and “Post Code” would be metadata. But, when considering the data management of an automated system that manages geographical data, “Post Code” might be data and then “data item name” and “8 characters, starting with A – Z” would be metadata.

In any particular context, metadata must be at a higher level of abstraction than the data it is describing. So, in relation to “E83BJ”, the item of data “is in London” is a further description of the place in the real world which has the post code “E83BJ” and is at the same level of abstraction. Therefore, although it is providing information about “E83BJ” (It is telling us that this is the post code of a place in London) this would not normally be considered metadata, as it is describing “E83BJ” qua place in the real world and not qua data.

Definitions The term was introduced intuitively, without a formal definition. Because of that, today there are various definitions. The most common one is the literal translation: Example: "12345" is data, and with no additional context is meaningless. When "12345" is given a meaningful name (metadata) of "ZIP code", one can understand (at least in the United States, and further placing "ZIP code" within the context of a postal address) that "12345" refers to the General Electric plant in Schenectady, New York.

As for most people the difference between data and information is merely a philosophical one of no relevance in practical use, other definitions are:

There are more sophisticated definitions, such as: These are used more rarely because they tend to concentrate on one purpose of metadata — to find "objects", "entities" or "resources" — and ignore others, such as using metadata to optimize data compression, or to perform additional computations using the data.

The metadata concept has been extended into the world of systems to include any "data about data": the names of tables, columns, programs, and the like. Different views of this "system metadata" are detailed below, but beyond that is the recognition that metadata can describe all aspects of systems: data, activities, people and organizations involved, locations of data and processes, access methods, limitations, timing and events, as well as motivation and rules.

Fundamentally, then, metadata is "the data that describe the structure and workings of an organization's use of information, and which describe the systems it uses to manage that information". To do a model of metadata is to do an "Enterprise modeling" of the information technology industry itself.William R. Durrell, Data Administration: A Practical Guide to Data Administration, McGraw-Hill, 1985

Hierarchies of metadata When structured into a hierarchical arrangement, metadata is more properly called an Ontology (computer science) or schema. Both terms describe "what exists" for some purpose or to enable some action. For instance, the arrangement of subject headings in a library catalog serves not only as a guide to finding books on a particular subject in the stacks, but also as a guide to what subjects "exist" in the library's own ontology and how more specialized topics are related to or derived from the more general subject headings.

Metadata is frequently stored in a central location and used to help organizations standardize their data. This information is typically stored in a metadata registry.

Difference between data and metadata Usually it is not possible to distinguish between (raw) data and metadata because:

These considerations apply no matter which of the above definitions is considered.

Use Metadata has many different applications; this section lists some of the most common.

Metadata is used to speed up and enrich searching for resources. In general, search queries using metadata can save users from performing more complex filter operations manually. It is now common for web browsers (with the notable exception of Mozilla Firefox), P2P applications and media management software to automatically download and locally cache metadata, to improve the speed at which files can be accessed and searched .

Metadata may also be associated to files manually. This is often the case with documents which are scanned into a document storage repository such as FileNet or Documentum. Once the documents have been converted into an electronic format a user brings the image up in a viewer application, manually reads the document and keys values into an online application to be stored in a metadata repository.

Metadata provide additional information to users of the data it describes. This information may be descriptive ("These pictures were taken by children in the school's third grade class.") or algorithmic ("Checksum=139F").

Metadata helps to bridge the semantic gap. By telling a computer how data items are related and how these relations can be evaluated automatically, it becomes possible to process even more complex filter and search operations. For example, if a search engine understands that "Van Gogh" was a "Dutch painter", it can answer a search query on "Dutch painters" with a link to a web page about Vincent Van Gogh, although the exact words "Dutch painters" never occur on that page. This approach, called knowledge representation, is of special interest to the semantic web and artificial intelligence.

Certain metadata is designed to optimize lossy data compression. For example, if a video has metadata that allows a computer to tell foreground from background, the latter can be compressed more aggressively to achieve a higher compression rate.

Some metadata is intended to enable variable content presentation. For example, if a picture has metadata that indicates the most important region — the one where there is a person — an image viewer on a small screen, such as on a mobile phone's, can narrow the picture to that region and thus show the user the most interesting details. A similar kind of metadata is intended to allow blind people to access diagrams and pictures, by converting them for special output devices or reading their description using speech synthesis software.

Other descriptive metadata can be used to automate workflows. For example, if a "smart" software tool knows content and structure of data, it can convert it automatically and pass it to another "smart" tool as input. As a result, users save the many cut, copy and paste operations required when analyzing data with "dumb" tools.

Metadata is becoming an increasingly important part of electronic discovery. Application and file system metadata derived from electronic documents and files can be important evidence. Recent changes to the Federal Rules of Civil Procedure make metadata routinely discoverable as part of Civil law (common law). Parties to litigation are required to maintain and produce metadata as part of discovery (law), and spoliation of evidence of metadata can lead to sanctions.

Metadata has become important on the World Wide Web because of the need to find useful information from the mass of information available. Manually-created metadata adds value because it ensures consistency. If a web page about a certain topic contains a word or phrase, then all web pages about that topic should contain that same word or phrase. Metadata also ensures variety, so that if a topic goes by two names each will be used. For example, an article about "sport utility vehicles" would also be tag (metadata) "4 wheel drives", "4WDs" and "four wheel drives", as this is how SUVs are known in some countries.

Examples of metadata for an Compact Disc include the MusicBrainz project and All Media Guide's All Music Guide. Similarly, MP3 files have metadata tags in a format called ID3.

Types of metadata Metadata can be classified by:

Important issues To successfully develop and use metadata, several important issues should be treated with care:

Metadata risks Microsoft Office files include metadata beyond their printable content, such as the original author's name, the creation date of the document, and the amount of time spent editing it. Unintentional disclosure can be awkward or even raise malpractice concerns. Some of Microsoft Office document's metadata can be seen by clicking File then Properties from the program's menu. Other metadata is not visible except through external analysis of a file, such as is done in forensics. The author of the Microsoft Word-based Melissa (computer worm) computer virus in 1999 was caught due to Word metadata that uniquely identified the computer used to create the original infected document.

Metadata lifecycle Even in the early phases of planning and designing it is necessary to keep track of all metadata created. It is not economical to start attaching metadata only after the production process has been completed. For example, if metadata created by a digital camera at recording time is not stored immediately, it may have to be restored afterwards manually with great effort. Therefore, it is necessary for different groups of resource producers to cooperate using compatible methods and standards.

Storage Metadata can be stored either internally, in the same file as the data, or externally, in a separate file. Both ways have advantages and disadvantages:

Moreover, there is the question of data format: storing metadata in a human-readable format such as XML can be useful because users can understand and edit it without specialized tools. On the other hand, these formats are not optimized for storage capacity; it may be useful to store metadata in a binary, non-human-readable format instead to speed up transfer and save memory.

Criticisms Although the majority of computer scientists see metadata as a chance for better interoperability, some critics argue:

The opposers of metadata sometimes use the term metacrap to refer to the unsolved problems of metadata in some scenarios.

Types In general, there are two distinct classes of metadata: structural or control metadata and guide metadata.Bretherton, F. P. and Singley, P. T. 1994, Metadata: A User's View, Proceedings of the International Conference on Very Large Data Bases (VLDB), 1091-1094 Structural metadata is used to describe the structure of computer systems such as tables, columns and indexes. Guide metadata is used to help humans find specific items and is usually expressed as a set of keywords in a natural language.

Metatadata can be divided into 3 distinct categories:



Relational database metadata Each relational database system has its own mechanisms for storing metadata. Examples of relational-database metadata include:

In database terminology, this set of metadata is referred to as the database catalog. The SQL standard specifies a uniform means to access the catalog, called the INFORMATION_SCHEMA, but not all databases implement it, even if they implement other aspects of the SQL standard. For an example of database-specific metadata access methods, see Oracle metadata.

Data warehouse metadata Data warehouse metadata systems are sometimes separated into two sections:
  • back room metadata that are used for Extract, transform, load functions to get OLTP data into a data warehouse
  • front room metadata that are used to label screens and create reports


  • KimballRalph Kimball, The Data Warehouse Lifecycle Toolkit, Wiley, 1998, ISBN 0-471-25547-5 lists the following types of metadata in a data warehouse (See also ):

    Michael Bracket defines metadata (what he calls "Data resource data") as "any data about the organization’s data resource".Guy V Tozer, Metadata Management for Information Control and Business Success, Artech House, 1999, ISBN 0-89006-280-3 Adrienne Tannenbaum defines metadata as "the detailed description of instance data. The format and characteristics of populated instance data: instances and values, dependent on the role of the metadata recipient".Adrienne Tannenbaum, Metadata Solutions: Using Metamodels, Repositories, XML, and Enterprise Portals to Generate Information on Demand, Addison-Wesley, 2002, ISBN 0-201-71976-2 These definitions are characteristic of the "data about data" definition.

    Business Intelligence metadata Business Intelligence is the process of analyzing large amounts of corporate data, usually stored in large databases such as the Data Warehouse, tracking business performance, detecting patterns and trends, and helping enterprise business users make better decisions.Business Intelligence metadata describes how data is queried, filtered, analyzed, and displayedin Business Intelligence software tools, such as Reporting tools, OLAP tools, Data Mining tools.

    Examples:

    Business Intelligence metadata can be used to understand how corporate financial reports reported to Wall Street are calculated, how the revenue, expense and profit are aggregated from individual sales transactions stored in the data warehouse.A good understanding of Business Intelligence metadata is required to solve complex problems such as compliance with corporate governance standards, such as Sarbanes Oxley (SOX) or Basel II.

    General IT metadata In contrast, David Marco, another metadata theorist, defines metadata as "all physical data and knowledge from inside and outside an organization, including information about the physical data, technical and business processes, rules and constraints of the data, and structures of the data used by a corporation."David Marco, Building and Managing the Meta Data Repository: A Full Lifecycle Guide, Wiley, 2000, ISBN 0-471-35523-2 Others have included web services, systems and interfaces. In fact, the entire Zachman framework (see Enterprise Architecture) can be represented as metadata.David C. Hay, Data Model Patterns: A Metadata Map, Morgan Kaufman, 2006, ISBN 0-12-088798-3

    Notice that such definitions expand metadata's scope considerably, to encompass most or all of the data required by the Management Information Systems capability. In this sense, the concept of metadata has significant overlaps with the ITIL concept of a Configuration Management Database (CMDB), and also with disciplines such as Enterprise Architecture and IT portfolio management.

    This broader definition of metadata has precedent. Third generation corporate repository products (such as those eventually merged into the CA Advantage line) not only store information about data definitions (COBOL copybooks, DBMS schema), but also about the programs accessing those data structures, and the Job Control Language and batch job infrastructure dependencies as well. These products (some of which are still in production) can provide a very complete picture of a mainframe computing environment, supporting exactly the kinds of impact analysis required for ITIL-based processes such as ITIL#Incident Management and Change Management (ITIL). The ITIL Back Catalogue includes the Data Management volume which recognizes the role of these metadata products on the mainframe, posing the CMDB as the distributed computing equivalent. CMDB vendors however have generally not expanded their scope to include data definitions, and metadata solutions are also available in the distributed world. Determining the appropriate role and scope for each is thus a challenge for large IT organizations requiring the services of both.

    Since metadata is pervasive, centralized attempts at tracking it need to focus on the most highly leveraged assets. Enterprise Assets may only constitute a small percentage of the entire IT portfolio.

    Some practitioners have successfully managed IT metadata using the Dublin Core metamodel.R. Todd Stephens (2003). Utilizing Metadata as a Knowledge Communication Tool. Proceedings of the International Professional Communication Conference 2004. Minneapolis, MN: Institute of Electrical and Electronics Engineers, Inc.

    IT metadata management products First generation data dictionary/metadata repository tools would be those only supporting a specific DBMS, such as IDMS's IDD (integrated data dictionary), the Information Management System Data Dictionary, and Adabas's Predict.

    Second generation would be ASG's DATAMANAGER product which could support many different file and DBMS types.

    Third generation repository products became briefly popular in the early 1990s along with the rise of widespread use of RDBMS engines such as IBM's IBM DB2.

    Fourth generation products link the repository with more Extract, transform, load tools and can be connected with architectural modeling tools. Examples include Adaptive Metadata Manager from Adaptive, Rochade from ASG, InfoLibrarian Metadata Integration Framework and Troux Technologies Metis Server product.

    File system metadata Nearly all file systems keep metadata about files out-of-band. Some systems keep metadata in directory (file systems) entries; others in specialized structure like inodes or even in the name of a file. Metadata can range from simple timestamps, mode bits, and other special-purpose information used by the implementation itself, to icon (computing)s and free-text comments, to arbitrary attribute-value pairs.

    With more complex and open-ended metadata, it becomes useful to search for files based on the metadata contents. The Unix find utility was an early example, although inefficient when scanning hundreds of thousands of files on a modern computer system. Apple Computer's current version of its Mac OS X operating system (Mac OS X v10.4) supports cataloguing and searching for file metadata through a feature known as Spotlight (software). Microsoft worked in the development of similar functionality with the Instant Search system in Windows Vista, as well as being present in SharePoint Server. Linux implements file metadata using extended file attributes.

    Image metadata Examples of image files containing metadata include Exchangeable image file format and Tagged Image File Format.

    Having metadata about images embedded in TIFF or EXIF files is one way of acquiring additional data about an image. Image metadata are attained through Tag (metadata). Tagging pictures with subjects, related emotions, and other descriptive phrases helps Internet users find pictures easily rather than having to search through entire image collections. A prime example of an image tagging service is Flickr, where users upload images and then describe the contents. Other patrons of the site can then search for those tags. Flickr uses a folksonomy: a free-text keyword system in which the community defines the vocabulary through use rather than through a controlled vocabulary.

    Digital photography is increasingly making use of metadata tags. Photographers shooting RAW image format file formats can use applications such as Adobe Bridge or Apple Computer's Aperture (photography software) to work with camera metadata for post-processing. Users can also tag photos for organization purposes using Adobe's Extensible Metadata Platform language, for example.

    Program metadata Metadata is casually used to describe the controlling data used in software architectures that are more abstract or configurable. Most executable formats include what may be termed "metadata" that specifies certain, usually configurable, behavioral runtime characteristics. However, it is difficult if not impossible to precisely distinguish program "metadata" from general aspects of Von Neumann architecture; if the machine reads it and acts upon it, it is a computational Instruction (computer science), and the prefix "meta" has little significance.

    In Java (programming language), the Class (file format) contains metadata used by the Java compiler and the Java virtual machine to dynamic linking class (computer science) and to support reflection (computer science). The J2SE 5.0 version of Java included a metadata facility for Java to allow additional annotations that are used by development tools.

    In MS-DOS, the COM file format does not include metadata, while the EXE and Windows Portable Executable formats do. These metadata can include the company that published the program, the date the program was created, the version number and more.

    In the .NET Framework executable format, extra metadata is included to allow Reflection (computer science) at runtime.

    Existing software metadata Object Management Group (OMG) has defined metadata format for representing entire existing applications for the purposes of software mining, software modernization and software assurance. This specification, called the OMG Knowledge Discovery Metamodel (KDM) is the OMG's foundation for "modeling in reverse". KDM is a common language-independednt intermediate representation that provides an integrated view of an entire enterpise application, including its behavior (program flow), data, and structure. One of the applications of KDM is Business Rules Mining.

    Knowledge Discovery Metamodel includes a fine grained low-level representation (called "micro KDM"), suitable for performing static analysis of programs.

    Document metadata Most programs that create documents, including Microsoft SharePoint, Microsoft Office Word and other Microsoft Office products, save metadata with the document files. These metadata can contain the name of the person who created the file (obtained from the operating system), the name of the person who last edited the file, how many times the file has been printed, and even how many revisions have been made on the file. Other saved material, such as deleted text (saved in case of an undelete command), document comments and the like, is also commonly referred to as "metadata", and the inadvertent inclusion of this material in distributed files has sometimes led to undesirable disclosures.

    Document Metadata is particularly important in legal environments where litigation can request this sensitive information (metadata) which can include many elements of private detrimental data. This data has been linked to multiple lawsuits that have got corporations into legal complications.

    Many legal firms today use "Metadata Management Software", also known as "Metadata Removal Tools". This software can be used to clean documents before they are sent outside of their firm. This process, known as metadata management, protects lawfirms from potentially unsafe leaking of sensitive data through Electronic Discovery.

    For a list of executable formats, see object file.

    Metamodels Metadata on Models are called Metamodels. In Model Driven Engineering, a Model (abstract) has to conform to a given Metamodel. According to the model-driven architecture guide, a metamodel is a model and each model conforms to a given metamodel. Meta-modeling allows strict and agile automatic processing of models and metamodels.

    The Object Management Group (OMG) defines 4 layers of meta-modeling. Each level of modeling is defined, validated by the next layer:

    Strange metadata Since metadata are also data, it is possible to have metadata of metadata–"meta-metadata." Machine-generated meta-metadata, such as the reversed index created by a free-text search engine, is generally not considered metadata, though.

    Metadata that are embedded with content is called embedded metadata. A data repository typically stores the metadata detached from the data.

    Digital library metadata There are three categories of metadata that are frequently used to describe objects in a digital library :
  • descriptive - Information describing the intellectual content of the object, such as MARC cataloguing records, finding aids or similar schemes. It is typically used for bibliographic purposes and for search and retrieval.
  • structural - Information that ties each object to others to make up logical units (e.g., information that relates individual images of pages from a book to the others that make up the book).
  • administrative - Information used to manage the object or control access to it. This may include information on how it was scanned, its storage format, copyright and licensing information, and information necessary for the digital preservation of the digital objects.


  • Geospatial metadata Metadata that describe geographic objects (such as datasets, maps, features, or simply documents with a geospatial component) have a history going back to at least 1994 (refer MIT Library page on FGDC Metadata). This class of metadata is described more fully on the Geospatial metadata page.

    See also

    References

    External links



    DC-dot
    A CGI based Dublin Core metadata generator ... DC-dot news. DC-dot now conforms with the Expressing Dublin Core in HTML/XHTML meta and link elements recommendation.

    UKOLN | Metadata
    Resources from the UK Office for Library and Information Networking

    metadata technology
    If your organisation is involved with business reporting or statistics then the Statistical Data and Metadata Exchange (SDMX) standards could save you time, money, and resources.

    GovTalk – Metadata
    Tranformational Government is part of the Cabinet Office and works to ensure that IT supports the business transformation of government itself so that government can provide better ...

    TASI :: Advice - Delivering Digital Images | Metadata Overview
    TASI - Metadata Overview ... Metadata Overview Contents. Introduction; What is metadata? Where does metadata come from? Where is metadata kept? Conclusion

    Curation Manual: Metadata
    View this Instalment: PDF (256KB) This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland [external] licence.

    Metadata - Wikipedia, the free encyclopedia
    Metadata (meta data, or sometimes metainformation) is "data about data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a ...

    The Metadata Company from FOLDOC
    The Metadata Company < company > A company founded by Jack Myers, originally known as Metadata Information Partners. http://metadata.com/. E-mail: .

    CETIS Metadata and Digital Repository SIG
    Please note: this site is no longer updated. For the current Metadata and Digital Repository SIG website, please visit http://wiki.cetis.ac.uk/Metadata.

    MWI: Metadata Workflow Investigation
    MWI: Metadata Workflow Investigation . Home. About. Resources. Contact. This web site is not maintained. MWI © 2004 . Metadata Workflow Investigation is study of ...





     
    Copyright © 2008 opini8.com - All rights reserved.
    Home | Terms of Use | Privacy Policy
    All Trademarks belong to their repective owners.
    Many aspects of this page are used under
    commercial commons license from Yahoo!