Difference between revisions of "invisible-data-considered-harmful"

From Microformats Wiki
Jump to navigation Jump to search
m (Replace <entry-title> with {{DISPLAYTITLE:}})
Line 1: Line 1:
<entry-title>invisible data considered harmful</entry-title>
{{DISPLAYTITLE:invisible data considered harmful}}
'''This article is a stub.'''
'''This article is a stub.'''

Latest revision as of 16:28, 18 July 2020

This article is a stub.

Invisible data or metadata for that matter is undesirable. The general problem with invisible data is that being invisible, nobody sees when it's wrong. If something is wrong on a web page and it's visible, a visitor can see it, a page author (without technical abilities) can see it. Invisible data doesn't get seen and errors inside it are not found as easily as visible data is.

See Principles of visibility and human friendliness.

Also known as, "dark data", "meta tags", "side files".

invisible metadata failures

  • meta keywords
    • popular in the 1990s, eventually spammed, and rotten, ignored by Google, Yahoo, and other search engines.
  • meta ICBM / geourl
    • people move, don't bother to update their meta ICBM
    • people get it wrong (because it's not obviously visible)
      • swapping lat and long
      • "correcting" negative values to positive (or vice versa)
      • note clusters of sites in the middle of oceans or other open spaces that correlate with inverse (or negated) coordinates of actual cities (see map image on https://web.archive.org/web/20040204183820/http://geourl.org/ and note red point clusters in oceans not on islands)
      • as reported in presentation: http://www.interconnected.org/notes/2004/02/etcon/thu_am_geourl.txt

        funny map. deviantart.com geocoded all of their users. there's a huge diagonal line across the map where people have typed in the same number twice for lat/long.

        there are also densities in the US and europe... but also tibet: the ghost blogs of tibet. it looks like the US, flipped. people transposing lat and long.

  • lang="en"
    • lots of templates, CMS's shipped with this default
    • people used them worldwide
    • now lang="en" is effectively meaningless/untrustworthy since tons of non-en sites all have it (from abovementioned templates etc.)
  • unintentional privacy violations through EXIF data
    • See privacypatterns.org: Strip invisible metadata
    • A number of people have unintentionally released their geolocation through its inclusion in EXIF data attached to photographs uploaded to social networking sites like Twitter. Twitter now strip EXIF data and Flickr allow users to remove EXIF data if needed.