rest/ahah

From Microformats Wiki
Revision as of 14:08, 19 November 2008 by Jonathanjefferies (talk | contribs) (Spelling error.)
Jump to navigation Jump to search

AHAH: Asynchronous HTML and HTTP

AHAH is a very simple technique for dynamically updating web pages using JavaScript. It involves using XMLHTTPRequest to retrieve (X)HTML fragments which are then inserted directly into the web page, whence they can be styled using CSS.

Contributors

Questions

  • Has anyone considered working with browser vendors to have them support client-side includes? This way, markup like <div src="hello-world.txt" /> would just work. This avoids the need for client side script and ensures the browser can correctly manage connections, caching, authentication, etc. For backwards compatibility, you could create a little javascript utility to scan for all occurrences of tags like this, queue up the requests and let the content flow in after the page was loaded. I have a little csi.js utility that does this and it's pretty handy. || Yes, Div Includes is a good concept (fraught with all the usual security issues), but the real power of includes is when they are used dynamically in response to user interaction. For purely static pages it is much more efficient to have the server create them using something like shtml or php. The ability to create/update a Div include via JS is where this proposed technique would be a fantastic improvement on the mess that we have now.
  • Okay, maybe this is a dumb question, but I don't understand why this is called AHAH instead of AJAH. AHAH still uses the same XMLHttpRequest JavaScript functionality as AJAX, so why drop the J? AJAX uses the same HTTP functionality as AHAH, so why add the H? Have I misunderstood what is actually happening? (You are right of course, but Ahah! == Eureka!!! and AJah = ?)

Relation to AJAX

AHAH is intended to be a much simpler way to do web development than AJAX: "Asynchronous JavaScript and XML." Strictly speaking, AHAH can be considered a subset of AJAX, since (X)HTML is just a special kind of XML. However, it is a subset with some very specific and useful properties:

  1. The lack of custom XML schemas dramatically reduces design time
  2. AHAH can trivially reuse existing HTML pages, avoiding the need for a custom web service
  3. All data transport is done via browser-friendly HTML, easing debugging and testing
  4. The HTML is designed to be directly embedded in the page's DOM, eliminating the need for parsing
  5. As HTML, designers can format it using CSS, rather than programmers having to do XSLT transforms
  6. Processing is all done on the server, so the client-side programming is essentially nil (moving opaque bits)

In fact, for any content that is destined to be viewed by the browser, it is virtually impossible to imagine any advantage to sending it as custom XML rather than structurally-correct HTML (with appropriate CSS-friendly class names, of course).

That said, many applications of AJAX are (at least in theory) targeteable at custom JavaScript code or desktop GUIs rather than mere browsers. For those cases, the advantages of HTML over custom XML are somewhat less. However, even here, it may well make sense to encode data using xoxo -- aka XHTML Property Lists -- which can be losslessly converted back and forth from standard data structures (lists and dictionaries) without the need for custom parsers.

Source Code

Unlike the various libraries (e.g., JSON, MochiKit) important for AJAX, all of AHAH is contained in a single JavaScript file (also available as ahah.js and jah.js). In fact, this is little more than the canonical XMLHttpRequest example, and is simple enough for any modern web designer to embed within their existing web pages.

NOTE: The example ahah.js mentioned here has the unfortunate requirement (if the delay argument is not undefined) that url,target,delay must be global variables (which makes one wonder why they are passed as parameters....). If you don't want to use globals, and want this to actually work, use something like: setTimeout( 'ahah( "' + url + '", "' + target + '", ' + delay + ')', delay ); Fair warning.

Send AHAH Request

function ahah(url,target) {
    document.getElementById(target).innerHTML = 'loading data...';
    if (window.XMLHttpRequest) {
        req = new XMLHttpRequest();
        req.onreadystatechange = function() {ahahDone(target);};
        req.open("GET", url, true);
        req.send(null);
    } else if (window.ActiveXObject) {
        req = new ActiveXObject("Microsoft.XMLHTTP");
        if (req) {
            req.onreadystatechange = function() {ahahDone(target);};
            req.open("GET", url, true);
            req.send();
        }
    }
} 

Note the workaround needed for IE's ActiveX implementation. The current version hard-codes GET; there may be value in adding an extra parameter to allow POST, PUT, and DELETE.

Receive AHAH Request

function ahahDone(target) {
   // only if req is "loaded"
   if (req.readyState == 4) {
       // only if "OK"
       if (req.status == 200 || req.status == 304) {
           results = req.responseText;
           document.getElementById(target).innerHTML = results;
       } else {
           document.getElementById(target).innerHTML="ahah error:\n" +
               req.statusText;
       }
   }
}

Other than error checking, the only real work done is:

document.getElementById(target).innerHTML = results;

Everything else (e.g., CSS-styling) is merely inherited from the parent webpage.

Executing Javascript

Since the browser won't execute <script> tags when changing innerHTML, you may want to apply the following function call on document.getElementById(target):

var bSaf = (navigator.userAgent.indexOf('Safari') != -1);
var bOpera = (navigator.userAgent.indexOf('Opera') != -1);
var bMoz = (navigator.appName == 'Netscape');
function execJS(node) {
  var st = node.getElementsByTagName('SCRIPT');
  var strExec;
  for(var i=0;i<st.length; i++) {     
    if (bSaf) {
      strExec = st[i].innerHTML;
    }
    else if (bOpera) {
      strExec = st[i].text;
    }
    else if (bMoz) {
      strExec = st[i].textContent;
    }
    else {
      strExec = st[i].text;
    }
    try {
      eval(strExec.split("<!--").join("").split("-->").join(""));
    } catch(e) {
      alert(e);
    }
  }
}

JavaScript Notes:

  • IE always returns tags in UPPER CASE, so you must search for SCRIPT and not script. Keep your SCRIPT tags in upper case for Firefox compatibility.
  • Do not use // comments, use /* */ instead. The eval executes all your javascript as though it was on one line, so you must end each statement with a semicolon (;).
  • Any functions declared in your JavaScript will go out of scope once the eval statement completes. It is possible to keep them in scope by attaching them as a method to a function that was defined at the document level.

Indexing

Another advantage of AHAH is that the dynamic XHTML content can be easily indexed by search engines; this avoids the need to inline all the dynamic content as hidden divs, which would increase page load times.

The current best practice for doing this is to:

  • included <link> tags in <head> of the parent page, to reference the various URLs retrieved by AHAH
  • include <redirects> in the outer HTML of the AHAH page, so that search hits go to an appropriate anchor on the parent page

It is possible that some crawlers will automatically index the URLs in the JavaScript calls, if recognized as such (e.g., due to the "html" extension, or if it is an absolute URL), though it is not clear how well this would work.

Meta tags

One odd characteristic of responseText is that it appears to preserve 'meta' and 'link' tags (though apparently not 'head' itself). This would potentially pollute the browser with illegal HTML, though since they're empty tags it shouldn't affect rendering, and any real-world browsers can be counted on to safely ignore it. One might want to add JavaScript to strip out the extra data, but that would likely be more overhead than it is worth (unless there is sensitive data in there you don't want curious source-viewers to see).

On the flip side, though, the preservation of meta tags allows the sending of additional key-value pairs, a la JSON. True, it doesn't support more complex data structures, but in many cases it will suffice, and it avoids the security concerns. For more complex metadata, it is probably better to use xoxo (perhaps with JSON conventions).

History

"AHAH" as a formal technique appears to have been introduced by Kevin Marks on May 12, 2005 under the name JAH: "Just Asynchronous HTML", where it was also used in a simple example. The term "AHAH" was proposed by Ernest Prabhakar during the 2005 Web 2.0 conference, and later adopted as part of the REST-Enabled XHTML (REX) microformat for web services.

David Hansson had independently discovered the exact same concept, and in fact had already submitted an abstract about it for O'Reilly's 2006 E-Tech conference when he encountered the work done by Marks and Prabhakar. He had not however named the technique, and quickly agreed to adopt the AHAH moniker. The same concept has no doubt been independently discovered by others, but these three appear to be the first to make a sustained attempt to promote it as a formal technique.

Implementations