View previous topic :: View next topic |
Author |
Topic : "converting threads to .doc" |
edraket member
Member # Joined: 18 Sep 2001 Posts: 505 Location: Rotterdam, The Netherlands
|
Posted: Wed Jul 17, 2002 1:30 am |
|
 |
Hey,
There is a whole lot of knowledge on this forum. Lots of old threads with superduper information.
When this server crashes (or the owner decides he needs the space for something else) that can all be lost.
So I'd like to take those threads that I find interesting and convert them into a document of sorts that I can save, print etc.
Can anyone give me some pointers on how to do that? Just pasting it into word doesn't seem to be a good idea. You get lots of junk and it often doesn't fit into the pages. Not to mention the dark blue background.
Is there any converter util out there that can help me do this?
Of course When I get around to doing all this I would also make the files available here. |
|
Back to top |
|
NeoFun member
Member # Joined: 12 Oct 2000 Posts: 263 Location: California
|
Posted: Wed Jul 17, 2002 2:05 am |
|
 |
Why not just save the html files? |
|
Back to top |
|
Sumaleth Administrator
Member # Joined: 30 Oct 1999 Posts: 2898 Location: Australia
|
Posted: Wed Jul 17, 2002 2:22 am |
|
 |
I'd like to backup the whole site. |
|
Back to top |
|
egerie member
Member # Joined: 30 Jul 2000 Posts: 693 Location: Montreal, Canada
|
Posted: Wed Jul 17, 2002 9:51 am |
|
 |
Teleport Pro ? |
|
Back to top |
|
Agrajag member
Member # Joined: 03 Mar 2002 Posts: 93
|
Posted: Wed Jul 17, 2002 11:04 pm |
|
 |
So far I've only saved singular threads by printing them pagewise via some postscript printer driver to one or more .ps-files, then copying them together and distilling them to PDF using GhostScript. That works quite well for me. If you're in the lucky position to own Adobe Acrobat, you can achieve the same (and even better) effect by web-capturing the thread. That way you don't even lose the links. Acrobat's a little costly, though...
I tried to backup some very important threads with Teleport Pro once. But since a lot of pages are served by the CGI-scripts, Teleport didn't do the trick for me, plus it took forever (I'm a modem user, so... ).
It should be possible to export the database into some legible format or other, but all the images would be gone methinks. That's gonna be a problem anyway, especially with old threads: most of the images are not available anymore. But the value of the written information alone is already worth capturing.  |
|
Back to top |
|
edraket member
Member # Joined: 18 Sep 2001 Posts: 505 Location: Rotterdam, The Netherlands
|
Posted: Wed Jul 17, 2002 11:33 pm |
|
 |
I think my work has a copy of acrobat. I'll sniff around.
Sure I can print them btw. But I would like to sort things, distill the good stuff and put it into one doc.
I'll see if the acrobat thing works.
thanks for the info. |
|
Back to top |
|
derm junior member
Member # Joined: 24 Mar 2002 Posts: 37
|
Posted: Thu Jul 18, 2002 12:10 pm |
|
 |
http://www.gnu.org/software/wget/wget.html
http://space.tin.it/computer/hherold/
code
Quote: |
wget -m -E -p -k -np -l 3 -H "http://www.sijun.com/cgi-bin/ultimatebb.cgi?ubb=forum&f=5" |
..should probably work just fine to back up a certain forum fairly well. I don't really want to test it.
code
Quote: |
wget -p -k -E -np -nd -H "http://www.sijun.com/cgi-bin/ultimatebb.cgi?ubb=get_topic&f=5&t=009076" |
Ought to do it for a single thread. I just tested this and it works fine.
<edit>I should add that it is bad to abuse the first type of archival, and as a courtesy to Dhabih should be avoided. This is why I didn't bother extensively testing the first line I gave.
<edit again>tarball and gzip the results of the backup to be extra "l33t."
[ July 18, 2002: Message edited by: derm ] |
|
Back to top |
|
|