aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeLines
...
* Start on new scraiper for sio.no.Petter Reinholdtsen2015-11-23-0/+30
|
* Document how to run a scraper.Petter Reinholdtsen2015-05-23-0/+5
|
* Ignore IDEA and data foldersAnders Einar Hilden2015-01-22-0/+2
|
* Almost ready to use [DMS2002]Anders Einar Hilden2015-01-22-31/+90
|
* Add debug info.Petter Reinholdtsen2015-01-20-0/+1
|
* Rewrite logic to select dates, to parse both forward and back in time.Petter Reinholdtsen2015-01-20-6/+23
|
* Add new source.Petter Reinholdtsen2015-01-19-0/+1
|
* New county scraper.Petter Reinholdtsen2015-01-19-0/+93
|
* New sources.Petter Reinholdtsen2015-01-18-0/+2
|
* Document missing fields.Petter Reinholdtsen2015-01-18-0/+1
|
* More info on common fields.Petter Reinholdtsen2015-01-18-0/+32
|
* We can extract all but 3 elementsAnders Einar Hilden2015-01-18-37/+178
|
* Starting to rewrite datafinder-dodeAnders Einar Hilden2015-01-18-30/+65
|
* Add meta-info.Petter Reinholdtsen2015-01-18-2/+29
|
* Cleanup.Petter Reinholdtsen2015-01-17-2/+12
|
* Get it limping along again.Petter Reinholdtsen2015-01-17-1/+11
|
* Document missing field.Petter Reinholdtsen2015-01-17-0/+1
|
* Fix parser.Petter Reinholdtsen2015-01-17-3/+3
|
* Make git ignore .pyc-files, and add a keepalive-file to the data folderAnders Einar Hilden2015-01-17-0/+1
|
* Add the correct libraryfile for dms2002Anders Einar Hilden2015-01-17-0/+217
|
* Add scraper library for DMS2002 - Software Innovation. Currently a separate ↵Anders Einar Hilden2015-01-16-0/+41
| | | | library, but might be merged with postliste-python-lib in the future
* Quiet down.Petter Reinholdtsen2015-01-16-3/+3
|
* First draft for Bergen kommune.Petter Reinholdtsen2015-01-16-0/+155
|
* Add run&test-instructionsAnders Einar Hilden2015-01-16-0/+9
|
* Start on README.Petter Reinholdtsen2015-01-16-0/+8
|
* Another batch of strange entries.Petter Reinholdtsen2015-01-13-1/+1
|
* Add test script for elasticsearch.Petter Reinholdtsen2015-01-13-0/+105
|
* Reduce noice and do not log passwords.Petter Reinholdtsen2015-01-04-1/+1
|
* Scan a bit more, and stay 100 000 behind the current pointer.Petter Reinholdtsen2015-01-04-2/+2
|
* Make sure errors are reported.Petter Reinholdtsen2015-01-04-1/+0
|
* Make scraper more robust.Petter Reinholdtsen2015-01-04-6/+8
|
* Ny scraper.Petter Reinholdtsen2015-01-04-0/+1
|
* Improve scraper.Petter Reinholdtsen2015-01-04-1/+4
|
* New scraper for Nordreisa kommune.Petter Reinholdtsen2015-01-04-0/+90
|
* Improve message.Petter Reinholdtsen2014-12-29-1/+2
|
* Do not crash on non-existing URLs.Petter Reinholdtsen2014-12-29-1/+3
|
* Accept problematic pages.Petter Reinholdtsen2014-12-28-2/+2
|
* Disable debugging.Petter Reinholdtsen2014-12-28-1/+1
|
* Add meta info.Petter Reinholdtsen2014-12-28-1/+9
|
* Reduce the days scanning backwards.Petter Reinholdtsen2014-12-27-1/+1
|
* Check a larger time period to handle vacations.Petter Reinholdtsen2014-12-27-2/+2
|
* Flush stdout too.Petter Reinholdtsen2014-12-22-0/+2
|
* Improve output.Petter Reinholdtsen2014-12-22-1/+2
|
* More compact output.Petter Reinholdtsen2014-12-22-3/+3
|
* Done reparsing strange entries.Petter Reinholdtsen2014-12-21-1/+1
|
* Allow scraper to use more CPU.Petter Reinholdtsen2014-12-21-1/+1
|
* Update and add meta info.Petter Reinholdtsen2014-12-20-2/+12
|
* Parse 2013 too, and reorder code.Petter Reinholdtsen2014-12-19-2/+5
|
* Avoid scraping more pages than we need to.Petter Reinholdtsen2014-12-19-1/+1
|
* Fix typo in pagination handling.Petter Reinholdtsen2014-12-19-1/+1
|