Friday, April 11, 2014

Changes 4/11

Ruby/Github:

  • Merged a branch and dealt with conflicts, this resource was a big help.
  • Pushed new code which would remove any newlines in the scraped data when writing to the csv
  • I then removed all the newline substitutions that were no longer necessary.
  • I also started looking at our regular expression for parsing NSF award numbers from the government interest field.  I came up with http://rubular.com/r/0f4YNgZFP1, or:
  • (?:(?:(?:\b[a-z]{3})|\b)(?:\d-?){6,8})\b
  • The ?: tell regex not to add the arguments in () to a capture group, which was the question I asked Alan, but he didn't know.  If you look on the rubular link, you can see why it is the way it is.

1 comment:

  1. Excellent blog posts, Alex! You are learning new things and moving your project forward. I'm looking forward to getting together with you and Kevin next Thursday.

    ReplyDelete