Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding various patterns to to training data #222

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 11 additions & 0 deletions measure_performance/test_data/dealstat_tests_v1.xml
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
<AddressCollection>
<AddressString><AddressNumber>200</AddressNumber> <StreetNamePreDirectional>EAST</StreetNamePreDirectional> <StreetName>ELM,</StreetName> <PlaceName>DENVER,</PlaceName> <StateName>COLORADO"</StateName></AddressString>
<AddressString><AddressNumber>55</AddressNumber> <StreetName>WINDSOR</StreetName> <StreetNamePostType>PLACE,</StreetNamePostType> <PlaceName>CHAMPAIGN,</PlaceName> <StateName>ILLINOIS"</StateName></AddressString>
<AddressString><AddressNumber>5</AddressNumber> <StreetNamePreDirectional>NORTH</StreetNamePreDirectional> <StreetName>MAIN,</StreetName> <PlaceName>VAN</PlaceName> <PlaceName>NUYS,</PlaceName> <StateName>CALIFORNIA"</StateName></AddressString>
<AddressString><AddressNumber>2609</AddressNumber> <StreetName>BAYVIEW,</StreetName> <PlaceName>FORT</PlaceName> <PlaceName>LAUDERDALE,</PlaceName> <StateName>FL"</StateName></AddressString>
<AddressString><AddressNumber>55</AddressNumber> <StreetName>WINDSOR</StreetName> <StreetNamePostType>PLACE,</StreetNamePostType> <PlaceName>CHAMPAIGN,</PlaceName> <StateName>ILLINOIS"</StateName></AddressString>
<AddressString><AddressNumber>6024</AddressNumber> <StreetName>8TH</StreetName> <StreetNamePostType>ST,</StreetNamePostType> <PlaceName>N.</PlaceName> <PlaceName>MIAMI,</PlaceName> <StateName>FL</StateName> <ZipCode>33144"</ZipCode></AddressString>
<AddressString><AddressNumber>5</AddressNumber> <StreetNamePreDirectional>NORTH</StreetNamePreDirectional> <StreetName>MAIN,</StreetName> <PlaceName>VAN</PlaceName> <PlaceName>NUYS,</PlaceName> <StateName>CALIFORNIA"</StateName></AddressString>
<AddressString><AddressNumber>783</AddressNumber> <StreetName>HOPE</StreetName> <StreetNamePostType>ST,</StreetNamePostType> <PlaceName>PROVIDENCE,</PlaceName> <StateName>RHODE</StateName> <StateName>ISLAND</StateName> <ZipCode>02906"</ZipCode></AddressString>
<AddressString><AddressNumber>200</AddressNumber> <StreetNamePreDirectional>EAST</StreetNamePreDirectional> <StreetName>ELM,</StreetName> <PlaceName>DENVER,</PlaceName> <StateName>COLORADO"</StateName></AddressString>
</AddressCollection>
29 changes: 29 additions & 0 deletions training/dealstat_addresses_v1.xml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
<AddressCollection>
<AddressString><AddressNumber>610</AddressNumber> <StreetNamePreDirectional>EAST</StreetNamePreDirectional> <StreetName>MAIN</StreetName> <PlaceName>MARION</PlaceName> <StateName>KANSAS"</StateName></AddressString>
<AddressString><AddressNumber>10</AddressNumber> <StreetNamePreDirectional>EAST</StreetNamePreDirectional> <StreetName>LAKE,</StreetName> <PlaceName>DENVER,</PlaceName> <StateName>COLORADO"</StateName></AddressString>
<AddressString><AddressNumber>2104</AddressNumber> <StreetName>WINDSOR</StreetName> <StreetNamePostType>PLACE,</StreetNamePostType> <PlaceName>CHAMPAIGN,</PlaceName> <StateName>ILLINOIS"</StateName></AddressString>
<AddressString><AddressNumber>2104</AddressNumber> <StreetName>WINDSOR</StreetName> <StreetNamePostType>PLACE</StreetNamePostType> <PlaceName>CHICAGO</PlaceName> <StateName>ILLINOIS"</StateName></AddressString>
<AddressString><AddressNumber>19</AddressNumber> <StreetName>HARGROVE</StreetName> <StreetNamePostType>GRADE,</StreetNamePostType> <PlaceName>PALM</PlaceName> <PlaceName>COAST</PlaceName> <StateName>FL"</StateName></AddressString>
<AddressString><AddressNumber>61</AddressNumber> <StreetNamePreDirectional>WEST</StreetNamePreDirectional> <StreetName>MAIN,</StreetName> <PlaceName>MARION,</PlaceName> <StateName>KANSAS"</StateName></AddressString>
<AddressString><AddressNumber>55</AddressNumber> <StreetNamePreDirectional>WEST</StreetNamePreDirectional> <StreetName>10TH</StreetName> <PlaceName>DENVER</PlaceName> <StateName>COLORADO"</StateName></AddressString>
<AddressString><AddressNumber>100</AddressNumber> <StreetNamePreDirectional>WEST</StreetNamePreDirectional> <StreetName>SEVENTH,</StreetName> <PlaceName>VAN</PlaceName> <PlaceName>NUYS,</PlaceName> <StateName>CALIFORNIA"</StateName></AddressString>
<AddressString><AddressNumber>225</AddressNumber> <StreetName>RIDGEDALE</StreetName> <StreetNamePostType>AVE,</StreetNamePostType> <PlaceName>N</PlaceName> <PlaceName>HANOVER,</PlaceName> <StateName>NJ</StateName> <ZipCode>07936"</ZipCode></AddressString>
<AddressString><AddressNumber>19</AddressNumber> <StreetName>HARGROVE</StreetName> <StreetNamePostType>GRADE,</StreetNamePostType> <PlaceName>PALM</PlaceName> <PlaceName>COAST</PlaceName> <StateName>FL"</StateName></AddressString>
<AddressString><AddressNumber>610</AddressNumber> <StreetNamePreDirectional>EAST</StreetNamePreDirectional> <StreetName>MAIN</StreetName> <PlaceName>MARION</PlaceName> <StateName>KANSAS"</StateName></AddressString>
<AddressString><AddressNumber>2104</AddressNumber> <StreetName>WINDSOR</StreetName> <StreetNamePostType>PLACE</StreetNamePostType> <PlaceName>CHICAGO</PlaceName> <StateName>ILLINOIS"</StateName></AddressString>
<AddressString><AddressNumber>61</AddressNumber> <StreetNamePreDirectional>WEST</StreetNamePreDirectional> <StreetName>MAIN,</StreetName> <PlaceName>MARION,</PlaceName> <StateName>KANSAS"</StateName></AddressString>
<AddressString><AddressNumber>20</AddressNumber> <StreetName>PARK</StreetName> <StreetNamePostType>STREET,</StreetNamePostType> <PlaceName>JOHNSTON,</PlaceName> <StateName>RHODE</StateName> <StateName>ISLAND</StateName> <ZipCode>02919"</ZipCode></AddressString>
<AddressString><AddressNumber>8</AddressNumber> <StreetName>ISLE</StreetName> <StreetName>OF</StreetName> <StreetName>VENICE,</StreetName> <PlaceName>FORT</PlaceName> <PlaceName>LAUDERDALE,</PlaceName> <StateName>FL</StateName> <ZipCode>33301"</ZipCode></AddressString>
<AddressString><AddressNumber>2735</AddressNumber> <StreetName>PAWTUCKET</StreetName> <StreetNamePostType>AVE</StreetNamePostType> <PlaceName>EAST</PlaceName> <PlaceName>PROVIDENCE</PlaceName> <StateName>RHODE</StateName> <StateName>ISLAND</StateName> <ZipCode>02914"</ZipCode></AddressString>
<AddressString><AddressNumber>977</AddressNumber> <StreetName>PLEASANT</StreetName> <StreetNamePostType>STREET,</StreetNamePostType> <PlaceName>N.</PlaceName> <PlaceName>ORANGE,</PlaceName> <StateName>NJ</StateName> <ZipCode>07052"</ZipCode></AddressString>
<AddressString><AddressNumber>5548</AddressNumber> <StreetName>ELMER</StreetName> <StreetNamePostType>AVENUE,</StreetNamePostType> <PlaceName>S</PlaceName> <PlaceName>HOLLYWOOD,</PlaceName> <StateName>CA</StateName> <ZipCode>91601"</ZipCode></AddressString>
<AddressString><AddressNumber>55</AddressNumber> <StreetNamePreDirectional>WEST</StreetNamePreDirectional> <StreetName>10TH</StreetName> <PlaceName>DENVER</PlaceName> <StateName>COLORADO"</StateName></AddressString>
<AddressString><AddressNumber>225</AddressNumber> <StreetNamePreDirectional>WEST</StreetNamePreDirectional> <StreetName>ELM,</StreetName> <PlaceName>FORT</PlaceName> <PlaceName>LAUDERDALE,</PlaceName> <StateName>FL</StateName> <ZipCode>33301"</ZipCode></AddressString>
<AddressString><AddressNumber>5548</AddressNumber> <StreetName>ELMER</StreetName> <StreetNamePostType>AVENUE,</StreetNamePostType> <PlaceName>N.</PlaceName> <PlaceName>HOLLYWOOD,</PlaceName> <StateName>CA</StateName> <ZipCode>91601"</ZipCode></AddressString>
<AddressString><AddressNumber>100</AddressNumber> <StreetNamePreDirectional>WEST</StreetNamePreDirectional> <StreetName>SEVENTH,</StreetName> <PlaceName>VAN</PlaceName> <PlaceName>NUYS,</PlaceName> <StateName>CALIFORNIA"</StateName></AddressString>
<AddressString><AddressNumber>20</AddressNumber> <StreetName>PARK</StreetName> <StreetNamePostType>STREET,</StreetNamePostType> <PlaceName>JOHNSTON,</PlaceName> <StateName>RHODE</StateName> <StateName>ISLAND"</StateName></AddressString>
<AddressString><AddressNumber>10</AddressNumber> <StreetNamePreDirectional>EAST</StreetNamePreDirectional> <StreetName>LAKE,</StreetName> <PlaceName>DENVER,</PlaceName> <StateName>COLORADO"</StateName></AddressString>
<AddressString><AddressNumber>1600</AddressNumber> <StreetNamePreDirectional>NE</StreetNamePreDirectional> <StreetName>4TH</StreetName> <PlaceName>FORT</PlaceName> <PlaceName>LAUDERDALE,</PlaceName> <StateName>FL"</StateName></AddressString>
<AddressString><AddressNumber>29</AddressNumber> <StreetName>UPLAND</StreetName> <StreetNamePostType>WAY,</StreetNamePostType> <PlaceName>BARRINGTON,</PlaceName> <StateName>RHODE</StateName> <StateName>ISLAND</StateName> <ZipCode>02806"</ZipCode></AddressString>
<AddressString><AddressNumber>2104</AddressNumber> <StreetName>WINDSOR</StreetName> <StreetNamePostType>PLACE,</StreetNamePostType> <PlaceName>CHAMPAIGN,</PlaceName> <StateName>ILLINOIS"</StateName></AddressString>
</AddressCollection>