Friday, January 31, 2014
Changes 1/31/14
I have been extremely sick this week, so I haven't been able to blog or really do anything IT related. I will try to do a bit this weekend (I will try to stay in the Database class since there is stuff due on Monday).
Monday, January 27, 2014
Thursday, January 23, 2014
Changes 1/23/14
NSF:
- Worked on CSV generation (not really sure how to format it)
Databasing:
- Did Relational Algebra stuff over the weekend and started SQL today
HTML/CSS:
- Updated website so it is no longer horrible
Thursday, January 16, 2014
Changes 1/16/14
Database Class:
- Finished DTD Exercises. They were kind of annoying and tedious, but I know more about DTDs than I did before, so I guess it was worth it.
- Watched 1st JSON video
Wednesday, January 15, 2014
Changes 1/15/14
NSF:
- Fixed the parser. It now parses the entire xml file and prints the values found on the console. This is extremely close to intended functionality. The only thing missing at this point is a few tags, which I couldn't find because these documents want a DTD from 2002, but I have the 2012 version. I am going to email Michelle about that tomorrow.
Tuesday, January 14, 2014
Changes 1/14/14
NSF:
- Set up the parsing so that the full xml file is split into smaller segments, with each one containing valid xml and the DTD thing at the top (I don't know what the proper name is)
- Theoretically this should have fixed everything, but instead it finds every value as being none except for <document-date> which is odd.
- I'll look at it tomorrow
Monday, January 13, 2014
Changes 1/13/14
NSF:
- The DTD doesn't reflect the xml file with the PCT fields. In the xml, the PCT is under <foreign-priority-data>, but in the DTD it instead points to <pct-or-regional-filing-data> as containing PCT data. The DTD I'm using is called "DTD US Patent Application Publications v43 2012-12-04."
- The DTD also makes no mention of Prior Publication Data
Databasing:
- Took the first quiz, XML DTDS and Schema, and got a 7 out of 8. I forgot to check whether the IDREF attributes actually had corresponding IDs on one of the questions, so I got that one wrong.
Changes 1/10/14
NSF:
- Since the xml is redefined several times in the patent application document, the parser only seems to be able to parse from within the first <patent-application-publication> element. I'm trying to find a solution to this, possibly by breaking up the large file into smaller files
Wednesday, January 8, 2014
Changes 1/8/14 (Updated after reading DTD)
NSF:
- Worked more on scraper
- The tags I have found so far (> indicates hierarchy)
- Publication date - document-id>document-date
- title - title-of-invention
- abstract - subdoc-abstract
- inventors - inventors
- CROSS-REFERENCE TO RELATED APPLICATION - subdoc-description>cross-reference-to-related-applications
- application number - domestic-filing-data>application-number>doc-number
- US filing date - domestic-filing-data>filing-date
- Note, not all patent applications will have the following fields - only if they have a corresponding international patent application will they have these fields: Different in DTD
- PCT filing date (this is the international application filing date) - foreign-priority-data>filing-date
- PCT patent application number - foreign-priority-data>priority-application-number>doc-number
- 371(c)(1),(2),(4) Date - ?
- PCT Publication number - ?
- PCT Publication date - ?
- Prior Publication Data - Not in DTD
- Related Patent documents - cross-reference-to-related-applications>paragraph
- Government Interest - subdoc-description>federal-research-statement>paragraph-federal-research-statementParent Case - continuity-data>division-of
Changes 12/20/13
NSF:
- I'm not sure which tags contain certain data:
- The PCT tags. I found:
<priority-application-number> <doc-number>PCT/US99/12540</doc-number> </priority-application-number> <filing-date>19990604</filing-date>
I'm not sure if this is the right tag, but that area is the only mention of PCT in the xml file. - I couldn't find any mention of Prior Publication Data, I'm not sure if this is a required field or not. All the applications had the tag
<publication-filing-type>new</publication-filing-type>
I don't know if this means that there isn't a Prior Publication Data field. - Related Patent Documents - I already had the field Cross Reference to Related Application, I couldn't find any other tags that looked like they might be talking about related patents
Tuesday, January 7, 2014
Monday, January 6, 2014
Subscribe to:
Posts (Atom)