Damn Mediawiki
July 20th, 2005I have been using Mediawiki (the same software used to run Wikipedia) for a private online wiki for some time now, for recording notes and research for my MSc dissertation. But, I’d like to do an HTML dump of all the pages from the current version of the database, for reference when I am offline (in this case, my five-hour train journey to London this weekend). Infruiatingly, there doesn’t seem to be that option - you can export to some specific XML format, but not produce a simple static HTML mirror of the site. The only plugin I’ve found no longer works. Anyone know of a (quick) solution?







July 20th, 2005 at 17:59:17
You could always use wget to splodge a load of pages out into HTML, if everything is linked to.
I often do web development in PHP and then output the final code (if someone requires HTML-only) in this way… works great. You need to have a little script to change all your links from .php to .html, but that’s a single line of sed, usually.
July 20th, 2005 at 19:12:06
Yeah I did think about that but it would take some time (have to make sure the fetch script doesn’t waste time fetching all the extra useless links like “edit this”, help page, random page etc.), and also would miss out on any orphaned pages. As I don’t need every single page on the wiki for definite I might just manually save the dozen or so I really do need… but it would be nice to have the lot.
July 21st, 2005 at 02:24:08
You have MeFi account (and at a vastly lower user number than me, by the by). AskMe is your friend.
Seriously, though, 25,000+ members is a good resource. Use it. Use it hard.