Friday, January 31, 2014

Changes 1/31/14

I have been extremely sick this week, so I haven't been able to blog or really do anything IT related.  I will try to do a bit this weekend (I will try to stay in the Database class since there is stuff due on Monday).

Monday, January 27, 2014

Changes 1/27/14

Databases:

  • Watched more Stanford videos

Thursday, January 23, 2014

Changes 1/23/14

NSF:

  • Worked on CSV generation (not really sure how to format it)

Databasing:

  • Did Relational Algebra stuff over the weekend and started SQL today

HTML/CSS:

  • Updated website so it is no longer horrible

Thursday, January 16, 2014

Changes 1/16/14

Database Class:

  • Finished DTD Exercises. They were kind of annoying and tedious, but I know more about DTDs than I did before, so I guess it was worth it.
  • Watched 1st JSON video

Wednesday, January 15, 2014

Changes 1/15/14

NSF:

  • Fixed the parser.  It now parses the entire xml file and prints the values found on the console.  This is extremely close to intended functionality.  The only thing missing at this point is a few tags, which I couldn't find because these documents want a DTD from 2002, but I have the 2012 version.  I am going to email Michelle about that tomorrow.

Tuesday, January 14, 2014

Changes 1/14/14

NSF:

  • Set up the parsing so that the full xml file is split into smaller segments, with each one containing valid xml and the DTD thing at the top (I don't know what the proper name is)
  • Theoretically this should have fixed everything, but instead it finds every value as being none except for <document-date> which is odd.
  • I'll look at it tomorrow

Monday, January 13, 2014

Changes 1/13/14

NSF:

  • The DTD doesn't reflect the xml file with the PCT fields. In the xml, the PCT is under <foreign-priority-data>, but in the DTD it instead points to <pct-or-regional-filing-data> as containing PCT data.  The DTD I'm using is called "DTD US Patent Application Publications v43 2012-12-04."  
  • The DTD also makes no mention of Prior Publication Data

Databasing:

  • Took the first quiz, XML DTDS and Schema, and got a 7 out of 8. I forgot to check whether the IDREF attributes actually had corresponding IDs on one of the questions, so I got that one wrong.

Changes 1/10/14

NSF:

  • Since the xml is redefined several times in the patent application document, the parser only seems to be able to parse from within the first <patent-application-publication> element.  I'm trying to find a solution to this, possibly by breaking up the large file into smaller files

Wednesday, January 8, 2014

Changes 1/8/14 (Updated after reading DTD)

NSF:

  • Worked more on scraper
  • The tags I have found so far (> indicates hierarchy)
  • Publication date - document-id>document-date
  • title - title-of-invention
  • abstract - subdoc-abstract
  • inventors - inventors
  • CROSS-REFERENCE TO RELATED APPLICATION - subdoc-description>cross-reference-to-related-applications
  • application number - domestic-filing-data>application-number>doc-number
  • US filing date - domestic-filing-data>filing-date
  • Note, not all patent applications will have the following fields - only if they have a corresponding international patent application will they have these fields: Different in DTD
    • PCT filing date (this is the international application filing date) - foreign-priority-data>filing-date
    • PCT patent application number - foreign-priority-data>priority-application-number>doc-number
    • 371(c)(1),(2),(4) Date - ?
    • PCT Publication number - ?
    • PCT Publication date - ?
  • Prior Publication Data - Not in DTD
  • Related Patent documents - cross-reference-to-related-applications>paragraph
  • Government Interest - subdoc-description>federal-research-statement>paragraph-federal-research-statement
     Parent Case - continuity-data>division-of

Changes 12/20/13

NSF:

  • I'm not sure which tags contain certain data:
  • The PCT tags.  I found:
    <priority-application-number>
    <doc-number>PCT/US99/12540</doc-number>
    </priority-application-number>
    <filing-date>19990604</filing-date>
    I'm not sure if this is the right tag, but that area is the only mention of PCT in the xml file.   
  • I couldn't find any mention of Prior Publication Data, I'm not sure if this is a required field or not.  All the applications had the tag
    <publication-filing-type>new</publication-filing-type>
    I don't know if this means that there isn't a Prior Publication Data field. 
  • Related Patent Documents - I already had the field Cross Reference to Related Application, I couldn't find any other tags that looked like they might be talking about related patents
  •  

Tuesday, January 7, 2014

Changes 1/7/14

NSF:

  • Added parent case to tag list (I think it's <continuity-data><division-of>...)

Monday, January 6, 2014

Changes 1/6/14

Math Drill:

  • Fixed some minor UI problems
  • Pushed all changes to git