[uf-discuss] uF Discovery?
brian.suda at gmail.com
Thu Nov 24 07:57:26 PST 2005
I'm not avocating something for discivery MUST be built. I am saying
"lets look at how other guys have tried to figure this out, has it
worked, or failed, and what can we learn from that?"
HTML has been successfully broken down into several pieces, CSS for
Microformats use HTML as the data layer, XMDP as the decription layer,
but there is no discovery (maybe there doesn't need to be?).
As Luke pointed out:
> Software that uses distributed uF data may want to
> see where all the uF data is and not get caught up
> in crawling the whole web. Context: REST APIs.
> - Luke
the advantage of saying what you have available will minumized the
crawl space. I can get one file that tells me everything, or crawl the
entire 40,000 Avon hCard pages to try to get the same thing.
Look at Google Sitemaps, that is a single file that describes the
pages on the site, along with last-update time. This helps to limit
un-needed crawls, bandwidth, time, etc.
This aproach has its downsides as well. Data drift is my biggest concern.
More information about the microformats-discuss