JungMin Posted March 8, 2006 Report Share Posted March 8, 2006 (edited) I have searched around, but everything i find seems to run text files to wiki. I am looking for something to convert the other way: wiki to a plain text document. I want to be able to view the text files on my mp3 player. Any ideas??? Edited March 8, 2006 by JungMin Quote Link to comment Share on other sites More sharing options...
Qchem Posted March 8, 2006 Report Share Posted March 8, 2006 I haven't tried either of these to see how feasible they are, but can you not just select all the text on a page and paste it into a text editor? Some web browsers can save as text too. Obviously if you're wanting to automate large chunks this isn't going to be appropriate. Quote Link to comment Share on other sites More sharing options...
JungMin Posted March 8, 2006 Author Report Share Posted March 8, 2006 Yeah, looking to do a large batch. I think it is more complicated than it seemed at first to me. But, hopefully someone has/or stumbled upon a script to do this??? Quote Link to comment Share on other sites More sharing options...
Qchem Posted March 8, 2006 Report Share Posted March 8, 2006 I wonder if you could do something with wget and html2txt?? Quote Link to comment Share on other sites More sharing options...
Steve Scrimpshire Posted March 8, 2006 Report Share Posted March 8, 2006 There is a perl module to handle Wiki -> different formats, found here: http://search.cpan.org/dist/Text-WikiForma...t/WikiFormat.pm But I am no expert in perl, so hopefully, someone can help you write a perl script to do what you wish or if you have the expertise, you can do it yourself. Quote Link to comment Share on other sites More sharing options...
JungMin Posted March 9, 2006 Author Report Share Posted March 9, 2006 Yeah, unfortunately i have zero experience in Perl (and only basic programming skills). I have found a few scripts (modules) that can transform wiki´s, but more code needs to be written. And unfortunately i cant do this. I was hoping there would be something already written.....but its not looking too good. Quote Link to comment Share on other sites More sharing options...
Jza Posted April 15, 2006 Report Share Posted April 15, 2006 Python HTMLParser is a good API process to convert the HTML into standardouput. HTMLParser will just focus on the div that contains the content and parse everything from there on. I did some info parsing from a table which then report me back with just the info (pirice) I needed. Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.