I was updating ubuntu studio, but the update didn't work right and /dev got corrupted or something like that. I booted into windows and installed a program that allowed me to mount the linux partition, and copied my /home/ folder out. I had to reinstall from a flash drive, and it took me a lot of tries because I didn't put the bootloader on the right partition.
For future reference, for me: The fat32 partition isn't for the bootloader, it's a dell thing. You also should format the ubuntu partition just in case. Put the bootloader on /dev/sda and it will work.
NSF:
- The program now scrapes http://patents.reedtech.com/parbft.php for all the patent urls, and should iterate through them. It also gives a readout of the amount downloaded for each file.
- I realized that the newer patent applications don't use <patent-application-publication> to denote the xml, but instead <us-patent-application>, which breaks the program as of now.
- Also, not all the zip files have the xml in their root directory. I am trying to figure out a way to iterate through the tree to find the xml file, but os.walk doesn't seem to work and I don't really want to unzip the entire zip directory tree (though I might have to)
- Added more convenience methods to clean up the code a bit