Friday, February 14, 2014

Changes 2/13/14 and 2/14/14

I was updating ubuntu studio, but the update didn't work right and /dev got corrupted or something like that.  I booted into windows and installed a program that allowed me to mount the linux partition, and copied my /home/ folder out.  I had to reinstall from a flash drive, and it took me a lot of tries because I didn't put the bootloader on the right partition.

For future reference, for me: The fat32 partition isn't for the bootloader, it's a dell thing. You also should format the ubuntu partition just in case. Put the bootloader on /dev/sda and it will work.

NSF:

  • The program now scrapes http://patents.reedtech.com/parbft.php for all the patent urls, and should iterate through them.  It also gives a readout of the amount downloaded for each file.
  • I realized that the newer patent applications don't use <patent-application-publication> to denote the xml, but instead <us-patent-application>, which breaks the program as of now.
  • Also, not all the zip files have the xml in their root directory.  I am trying to figure out a way to iterate through the tree to find the xml file, but os.walk doesn't seem to work and I don't really want to unzip the entire zip directory tree (though I might have to)
  • Added more convenience methods to clean up the code a bit

1 comment:

  1. I'm here at the NSF this weekend so I've had the chance to talk to both Michelle and Randy. She is going to create some milestones for you (which is why I just emailed the time line).

    ReplyDelete