At dmarchette.com/agxml.tgz is a gzipped tar file with two files in it, the raw xml data for all 1382 English/French documents, one language per file. Note: things in [[ ]], things like: [[de:Liga von Cambrai]] are links to other languages (in this case German) and there are lots of other things like this in these pages.