Heterogenous Datasources and Metadata

Dublin Core:

(The following list was extracted from http://purl.org/dc/)

  • Title
    Definition: A name given to the resource.

  • Creator
    Definition: An entity primarily responsible for making the content of the resource.
    Comment: Examples of a Creator include a person, an organisation, or a service.

  • Subject
    Definition: The topic of the content of the resource.
    Comment: Typically, a Subject will be expressed as keywords, key phrases or classification codes that describe a topic of the resource.

  • Description
    Definition: An account of the content of the resource.
    Comment: Description may include but is not limited to: an abstract, table of contents, reference to a graphical representation of content or a free-text account of the content.

  • Publisher
    Definition: An entity responsible for making the resource available Comment: Examples of a Publisher include a person, an organisation, or a service.

  • Contributor
    Definition: An entity responsible for making contributions to the content of the resource.
    Comment: Examples of a Contributor include a person, an organisation, or a service.

  • Date
    Definition: A date associated with an event in the life cycle of the resource.
    Comment: Typically, Date will be associated with the creation or availability of the resource.

  • Type
    Definition: The nature or genre of the content of the resource.
    Comment: Type includes terms describing general categories, functions, genres, or aggregation levels for content.

  • Format
    Definition: The physical or digital manifestation of the resource.
    Comment: Typically, Format may include the media-type or dimensions of the resource. Format may be used to determine the software, hardware or other equipment needed to display or operate the resource. Examples of dimensions include size and duration.

  • Identifier
    Definition: An unambiguous reference to the resource within a given context.
    Comment: Recommended best practice is to identify the resource by means of a string or number conforming to a formal identification system. Example formal identification systems include the Uniform Resource Identifier (URI) (including the Uniform Resource Locator (URL)), the Digital Object Identifier (DOI) and the International Standard Book Number (ISBN).

  • Source
    Definition: A Reference to a resource from which the present resource is derived.
    Comment: The present resource may be derived from the Source resource in whole or in part.

  • Language
    Definition: A language of the intellectual content of the resource.

  • Relation
    Definition: A reference to a related resource.

  • Coverage
    Definition: The extent or scope of the content of the resource.
    Comment: Coverage will typically include spatial location (a place name or geographic coordinates), temporal period (a period label, date, or date range) or jurisdiction (such as a named administrative entity).

  • Rights
    Definition: Information about rights held in and over the resource.
    Comment: Typically, a Rights element will contain a rights management statement for the resource, or reference a service providing such information. Rights information often encompasses Intellectual Property Rights (IPR), Copyright, and various Property Rights. If the Rights element is absent, no assumptions can be made about the status of these and other rights with respect to the resource.

    3 Levels of Internet Protocols:

    1) Networking Protocols : HTML, HTTP

    2) Metadata, Knowledge Exchange, Signature Protocols, Style Sheets

    XML Extensible Markup Language;

    XSL Extensible Style Language, CSS Cascading Style Sheets

    RDF Resource Description Framework; language for writing metadata

    Example:

    <RDF xmlns:DC="http://purl.org/DC">
    <Description about="http://www.w3.org/folio.html">
    <DC:Title>The W3C Folio 1999</DC:Title>
    <DC:Creator>W3C Communications Team</DC:Creator>
    <DC:Date>1999-03-10</DC:Date>
    <DC:Subject>Web development, World Wide Web Consortium, Interoperability of the Web</DC:Subject>
    </Description>
    </RDF>

    3) Social Protocols: Privacy and Content Selection

    P3P Platform for Privacy Preferences; web sites state their privacy practices which user agents match against the preferences defined by the user; based on RDF

    PICS Platform for Internet Content Selection; standard for annotating (rating) webpages according to their content; based on RDF

    Example:

    <META http-equiv="PICS-Label" content='
    (PICS-1.1 "http://www.w3c.org/PICS/vocab.html"
    labels for "http://w3.org/PICS/vocab.html"
    exp "2000.09.04T08:15-0500"
    ratings (q2 v3))'>