Genealogy Formats

(Difference between revisions)

Jump to: navigation, search
m (related-pages template)
Current revision (15:30, 25 July 2013) (view source)
(subheads, fix typo)
 
(8 intermediate revisions not shown.)
Line 1: Line 1:
-
= Genealogy Formats =
+
<entry-title> Genealogy Formats </entry-title>
-
I started this page because someone (Bob Jonkman apparently) added a bunch of stuff to the [http://developers.technorati.com/wiki/MicroFormats Technorati microformats page] on genealogy, and I moved it here. -[http://tantek.com/log/ Tantek]
+
Per the microformats [[process]], towards the development of a [[genealogy]] microformat, this page documents previous/existing genealogy related formats.
-
 
+
-
 
+
-
== In the wild ==
+
-
 
+
-
see: The Dring tree [http://www.sussexbarn.com/dring/web/dring/pafg01.htm] for an interesting family tree website.
+
-
 
+
-
[http://www.comp.utas.edu.au/users/rsmith/levett/wc01/wc01_045.html this family group] is pretty much a direct translation of a gedcom FAM structure, but with some names added to the links. It also includes back links to parents.
+
-
 
+
-
[http://www.comp.utas.edu.au/users/rsmith/levett/ps02/ps02_361.html an individual from the same tree] This is basically an INDI record from GEDCOM.
+
-
 
+
-
== Problem Statement ==
+
-
The main problem for genealogy on the web is that many people are posting their family trees, but if you were searching for your ancestors, there is no semantic in these pages which helps you link them to similar named individuals in your own tree. Some sites like FreeCEN and FreeBMD have databases which can assist in this linkage, but they are incomplete and frustrating to use.
+
-
 
+
-
If there were some kind of order to this process, ordinary web searching might be used; and we could interlink family trees more readily.
+
-
 
+
-
[http://jay.askren.net/Projects/SemWeb/ RDF] and the semantic web has been used to tackle this problem, but this doesnt help people that want to publish, or search published trees until there is a real semantic web.
+
-
 
+
-
What I think we need is some kind of microformat markup to add to examples like [http://jay.askren.net/Projects/SemWeb/FamilyTrees/AbrahamLincoln.html this tree of Abraham Lincoln].
+
== GEDCOM ==
== GEDCOM ==
Line 28: Line 10:
* I'm not sure whether it makes sense to do GEDCOM as its own format, the FAM structure and the need to present different reports, suggest to me that we need some kind of post-GEDCOM markup. To see how direct use of GEDCOM might pan out I hacked up this [[GEDCOM Worked example]]. To me the main issue seems to revolve around the FAM structure. I think the [http://jay.askren.net/Projects/SemWeb/ Jay Askren] approach might be better than the Gene Stark work as a starting point.
* I'm not sure whether it makes sense to do GEDCOM as its own format, the FAM structure and the need to present different reports, suggest to me that we need some kind of post-GEDCOM markup. To see how direct use of GEDCOM might pan out I hacked up this [[GEDCOM Worked example]]. To me the main issue seems to revolve around the FAM structure. I think the [http://jay.askren.net/Projects/SemWeb/ Jay Askren] approach might be better than the Gene Stark work as a starting point.
-
 
* Had a look at some examples of what GEDCOM creates [http://en.wikipedia.org/wiki/GEDCOM#Example].  Basically, seems to be [[xfn|XFN]] relationships (siblings, spouses etc.) and [[hcard|hCard]] information (could genealogy be inferred from existing XFNs regardless of a hGED format?). The only additional information we do not currently hold in a format is that of gender. GEDCOM specifies male or female for each individual. Creating something using these formats would be quite straightforward, but not sure its takeup would be good unless someone was interested in creating a hGEDCOM2GEDCOM. -- [[user:Phae|Frances Berriman]]
* Had a look at some examples of what GEDCOM creates [http://en.wikipedia.org/wiki/GEDCOM#Example].  Basically, seems to be [[xfn|XFN]] relationships (siblings, spouses etc.) and [[hcard|hCard]] information (could genealogy be inferred from existing XFNs regardless of a hGED format?). The only additional information we do not currently hold in a format is that of gender. GEDCOM specifies male or female for each individual. Creating something using these formats would be quite straightforward, but not sure its takeup would be good unless someone was interested in creating a hGEDCOM2GEDCOM. -- [[user:Phae|Frances Berriman]]
Line 44: Line 25:
::[[User:Bob Jonkman|Bob Jonkman]] 07:58, 9 Feb 2007 (PST)
::[[User:Bob Jonkman|Bob Jonkman]] 07:58, 9 Feb 2007 (PST)
-
==Wikipedia's Persondata==
+
== GEDCOM Replacement Efforts ==
 +
 
 +
There are currently two major efforts to develop a replacement for the largely out-of-date GEDCOM format (last updated in 1999).
 +
 
 +
=== GEDCOM X ===
 +
One effort is GEDCOM X [http://www.gedcomx.org/], by FamilySearch, the original creator of GEDCOM. While the format is openly published on github and the development is fairly transparent, it is completely controlled by FamilySearch (a division of the Mormon church). Includes JSON and XML serialization formats, as well as a file format which includes many files compressed into a zip file.
 +
 
 +
=== FHSIO ===
 +
The other effort is the Family History Information Standards Organization (FHSIO) [http://fhiso.org/] which is gathering member companies into a consortium to develop a replacement format. Part of the goal of FHISO is specifically to take genealogy standards out of the control of a single organization. FHISO was spawned out of a grass-roots effort to replace GEDCOM called BetterGEDCOM [http://bettergedcom.wikispaces.com/].
 +
 
 +
==Wikipedia Persondata==
Wikipedia's [http://en.wikipedia.org/wiki/Wikipedia:Persondata Persondata] aligns very closely with hCard, but has additional date and place of birth & death fields. [[User:AndyMabbett|Andy Mabbett]] 13:04, 28 Jan 2007 (PST)
Wikipedia's [http://en.wikipedia.org/wiki/Wikipedia:Persondata Persondata] aligns very closely with hCard, but has additional date and place of birth & death fields. [[User:AndyMabbett|Andy Mabbett]] 13:04, 28 Jan 2007 (PST)
 +
 +
==vCard birth death extensions==
 +
http://tools.ietf.org/html/draft-li-vcarddav-vcard-id-property-extensions
 +
 +
This vCard extension draft proposes new properties related to birth location, death date, and death location.
 +
* BIRTHPLACE
 +
* DEATHPLACE
 +
* DEATHDATE
== External Links ==
== External Links ==

Current revision


Per the microformats process, towards the development of a genealogy microformat, this page documents previous/existing genealogy related formats.

Contents

GEDCOM

GEDCOM has become pretty much the defacto standard for sharing data between genealogy systems. It is hierarchical and link based, much like HTML; but it encodes family structure (which is a general graph) outside of this structural hierarchy.
GEDCOM was developed (...) to provide a flexible, uniform format for exchanging computerized genealogical data.[1]
The only relationship links in GEDCOM are HUSBand, WIFE and CHILd. All other relationships (brother, sister, grandparents, grandchildren, uncles, aunts, nieces, nephews, cousins) can be inferred by traversing family records. This does mean that any collection of genealogical pages need some way to cross-reference to each other. This isn't a problem for all pages on a single Web site, which use RIN (Record Identifier) or REFN (User Reference Number). However, different Web pages maintained by different genealogists may have conflicting RINs and REFNs. There is a globally-unique AFN (Ancestral File Number) issued by the Church of Jesus Christ of Latter-Day Saints (LDS), but I don't know how they're issued and most genealogical sites don't use them anyway.
The GEDCOM format contains much other data specific to the LDS, but I don't know how widespread it is, nor how appropriate it would be to code it into a microformat intended to reach well beyond the LDS.
Regardless of whether an hGED microformat is developed, it would still be valuable to mark up genealogical information with microformats on Web pages for the semantic value.
Bob Jonkman 07:58, 9 Feb 2007 (PST)

GEDCOM Replacement Efforts

There are currently two major efforts to develop a replacement for the largely out-of-date GEDCOM format (last updated in 1999).

GEDCOM X

One effort is GEDCOM X [3], by FamilySearch, the original creator of GEDCOM. While the format is openly published on github and the development is fairly transparent, it is completely controlled by FamilySearch (a division of the Mormon church). Includes JSON and XML serialization formats, as well as a file format which includes many files compressed into a zip file.

FHSIO

The other effort is the Family History Information Standards Organization (FHSIO) [4] which is gathering member companies into a consortium to develop a replacement format. Part of the goal of FHISO is specifically to take genealogy standards out of the control of a single organization. FHISO was spawned out of a grass-roots effort to replace GEDCOM called BetterGEDCOM [5].

Wikipedia Persondata

Wikipedia's Persondata aligns very closely with hCard, but has additional date and place of birth & death fields. Andy Mabbett 13:04, 28 Jan 2007 (PST)

vCard birth death extensions

http://tools.ietf.org/html/draft-li-vcarddav-vcard-id-property-extensions

This vCard extension draft proposes new properties related to birth location, death date, and death location.

External Links

See also

Genealogy Formats was last modified: Thursday, July 25th, 2013

Views