Before I upload and post the link for the archive though, I wanted to make you all aware that the effort to convert to EPUB or PDF has hit a snag.
- PDF is possible if I duplicate the entire download effort once again. Time consuming (maybe two weeks) but feasable. Note that the way this is handled, I would be downloading 3000-4000 pages, at a guess, as it uses the default of 15 messages per web page displayed; the HTML pages were downloaded using 40/page. There is no way to get around this setting. (For those wondering, every webpage of 15 messages/page is roughly 4 PDF pages in length. So this is going to end up being a large PDF file.)
- EPUB seems to be a major problem as it requires manually editing (to an unknown extent as yet) each page's source code. Search and Replace will do 80+% of the work I would guess - but only after I spend a couple of days figuring out what can be stripped and what must remain. Then tweak things a bit for better display.
So, I wanted to ask the TOR Community how critical the ability to search through the entire forum is on the following scale:
1 = Absolutely Vital
2 = Easier, but I can make do
3 = Whatever, I'll find it eventually
4 = Doubtful I'll search the archive
5 = Waste of effort
By my personal standards, it's option 1 = Absolutely Vital. While the existence of the archive means we've got a copy even if catastrophe strikes, without being in an easily searchable format, few users will go back and peruse the information inside.
So, please, enter your vote so I have some idea whether I should put in the required effort. And while you're at it, any preference - PDF or EPUB?