aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeLines
...
* Start on new scraiper for sio.no.Petter Reinholdtsen2015-11-23-0/+30
* Document how to run a scraper.Petter Reinholdtsen2015-05-23-0/+5
* Ignore IDEA and data foldersAnders Einar Hilden2015-01-22-0/+2
* Almost ready to use [DMS2002]Anders Einar Hilden2015-01-22-31/+90
* Add debug info.Petter Reinholdtsen2015-01-20-0/+1
* Rewrite logic to select dates, to parse both forward and back in time.Petter Reinholdtsen2015-01-20-6/+23
* Add new source.Petter Reinholdtsen2015-01-19-0/+1
* New county scraper.Petter Reinholdtsen2015-01-19-0/+93
* New sources.Petter Reinholdtsen2015-01-18-0/+2
* Document missing fields.Petter Reinholdtsen2015-01-18-0/+1
* More info on common fields.Petter Reinholdtsen2015-01-18-0/+32
* We can extract all but 3 elementsAnders Einar Hilden2015-01-18-37/+178
* Starting to rewrite datafinder-dodeAnders Einar Hilden2015-01-18-30/+65
* Add meta-info.Petter Reinholdtsen2015-01-18-2/+29
* Cleanup.Petter Reinholdtsen2015-01-17-2/+12
* Get it limping along again.Petter Reinholdtsen2015-01-17-1/+11
* Document missing field.Petter Reinholdtsen2015-01-17-0/+1
* Fix parser.Petter Reinholdtsen2015-01-17-3/+3
* Make git ignore .pyc-files, and add a keepalive-file to the data folderAnders Einar Hilden2015-01-17-0/+1
* Add the correct libraryfile for dms2002Anders Einar Hilden2015-01-17-0/+217
* Add scraper library for DMS2002 - Software Innovation. Currently a separate l...Anders Einar Hilden2015-01-16-0/+41
* Quiet down.Petter Reinholdtsen2015-01-16-3/+3
* First draft for Bergen kommune.Petter Reinholdtsen2015-01-16-0/+155
* Add run&test-instructionsAnders Einar Hilden2015-01-16-0/+9
* Start on README.Petter Reinholdtsen2015-01-16-0/+8
* Another batch of strange entries.Petter Reinholdtsen2015-01-13-1/+1
* Add test script for elasticsearch.Petter Reinholdtsen2015-01-13-0/+105
* Reduce noice and do not log passwords.Petter Reinholdtsen2015-01-04-1/+1
* Scan a bit more, and stay 100 000 behind the current pointer.Petter Reinholdtsen2015-01-04-2/+2
* Make sure errors are reported.Petter Reinholdtsen2015-01-04-1/+0
* Make scraper more robust.Petter Reinholdtsen2015-01-04-6/+8
* Ny scraper.Petter Reinholdtsen2015-01-04-0/+1
* Improve scraper.Petter Reinholdtsen2015-01-04-1/+4
* New scraper for Nordreisa kommune.Petter Reinholdtsen2015-01-04-0/+90
* Improve message.Petter Reinholdtsen2014-12-29-1/+2
* Do not crash on non-existing URLs.Petter Reinholdtsen2014-12-29-1/+3
* Accept problematic pages.Petter Reinholdtsen2014-12-28-2/+2
* Disable debugging.Petter Reinholdtsen2014-12-28-1/+1
* Add meta info.Petter Reinholdtsen2014-12-28-1/+9
* Reduce the days scanning backwards.Petter Reinholdtsen2014-12-27-1/+1
* Check a larger time period to handle vacations.Petter Reinholdtsen2014-12-27-2/+2
* Flush stdout too.Petter Reinholdtsen2014-12-22-0/+2
* Improve output.Petter Reinholdtsen2014-12-22-1/+2
* More compact output.Petter Reinholdtsen2014-12-22-3/+3
* Done reparsing strange entries.Petter Reinholdtsen2014-12-21-1/+1
* Allow scraper to use more CPU.Petter Reinholdtsen2014-12-21-1/+1
* Update and add meta info.Petter Reinholdtsen2014-12-20-2/+12
* Parse 2013 too, and reorder code.Petter Reinholdtsen2014-12-19-2/+5
* Avoid scraping more pages than we need to.Petter Reinholdtsen2014-12-19-1/+1
* Fix typo in pagination handling.Petter Reinholdtsen2014-12-19-1/+1