Changes 4/11
Ruby/Github:
- Merged a branch and dealt with conflicts, this resource was a big help.
- Pushed new code which would remove any newlines in the scraped data when writing to the csv
- I then removed all the newline substitutions that were no longer necessary.
- I also started looking at our regular expression for parsing NSF award numbers from the government interest field. I came up with http://rubular.com/r/0f4YNgZFP1, or:
- (?:(?:(?:\b[a-z]{3})|\b)(?:\d-?){6,8})\b
- The ?: tell regex not to add the arguments in () to a capture group, which was the question I asked Alan, but he didn't know. If you look on the rubular link, you can see why it is the way it is.
Excellent blog posts, Alex! You are learning new things and moving your project forward. I'm looking forward to getting together with you and Kevin next Thursday.
ReplyDelete