Commit message (Collapse) | Author | Age | Lines | |
---|---|---|---|---|
* | Handle environment without CPU limitation. | Petter Reinholdtsen | 2016-04-06 | -1/+6 |
| | | | | | | Make sure oep scraper do not exit right away when there is no CPU limit, instead assume a default 10 second limit in this case. 10 seconds is choosed fairly randomly to limit the runtime to a few minutes. | |||
* | Update the setup instructions and add a title. | Petter Reinholdtsen | 2016-04-05 | -6/+26 |
| | ||||
* | Typo. | Petter Reinholdtsen | 2016-03-26 | -1/+1 |
| | ||||
* | Fix OEP scraper. | Petter Reinholdtsen | 2016-03-26 | -1/+8 |
| | | | | | Get OEP scraper working again after the source return 500 Internal Server Error for non-existing entries. | |||
* | Start on new scraiper for sio.no. | Petter Reinholdtsen | 2015-11-23 | -0/+30 |
| | ||||
* | Document how to run a scraper. | Petter Reinholdtsen | 2015-05-23 | -0/+5 |
| | ||||
* | Ignore IDEA and data folders | Anders Einar Hilden | 2015-01-22 | -0/+2 |
| | ||||
* | Almost ready to use [DMS2002] | Anders Einar Hilden | 2015-01-22 | -31/+90 |
| | ||||
* | Add debug info. | Petter Reinholdtsen | 2015-01-20 | -0/+1 |
| | ||||
* | Rewrite logic to select dates, to parse both forward and back in time. | Petter Reinholdtsen | 2015-01-20 | -6/+23 |
| | ||||
* | Add new source. | Petter Reinholdtsen | 2015-01-19 | -0/+1 |
| | ||||
* | New county scraper. | Petter Reinholdtsen | 2015-01-19 | -0/+93 |
| | ||||
* | New sources. | Petter Reinholdtsen | 2015-01-18 | -0/+2 |
| | ||||
* | Document missing fields. | Petter Reinholdtsen | 2015-01-18 | -0/+1 |
| | ||||
* | More info on common fields. | Petter Reinholdtsen | 2015-01-18 | -0/+32 |
| | ||||
* | We can extract all but 3 elements | Anders Einar Hilden | 2015-01-18 | -37/+178 |
| | ||||
* | Starting to rewrite datafinder-dode | Anders Einar Hilden | 2015-01-18 | -30/+65 |
| | ||||
* | Add meta-info. | Petter Reinholdtsen | 2015-01-18 | -2/+29 |
| | ||||
* | Cleanup. | Petter Reinholdtsen | 2015-01-17 | -2/+12 |
| | ||||
* | Get it limping along again. | Petter Reinholdtsen | 2015-01-17 | -1/+11 |
| | ||||
* | Document missing field. | Petter Reinholdtsen | 2015-01-17 | -0/+1 |
| | ||||
* | Fix parser. | Petter Reinholdtsen | 2015-01-17 | -3/+3 |
| | ||||
* | Make git ignore .pyc-files, and add a keepalive-file to the data folder | Anders Einar Hilden | 2015-01-17 | -0/+1 |
| | ||||
* | Add the correct libraryfile for dms2002 | Anders Einar Hilden | 2015-01-17 | -0/+217 |
| | ||||
* | Add scraper library for DMS2002 - Software Innovation. Currently a separate ↵ | Anders Einar Hilden | 2015-01-16 | -0/+41 |
| | | | | library, but might be merged with postliste-python-lib in the future | |||
* | Quiet down. | Petter Reinholdtsen | 2015-01-16 | -3/+3 |
| | ||||
* | First draft for Bergen kommune. | Petter Reinholdtsen | 2015-01-16 | -0/+155 |
| | ||||
* | Add run&test-instructions | Anders Einar Hilden | 2015-01-16 | -0/+9 |
| | ||||
* | Start on README. | Petter Reinholdtsen | 2015-01-16 | -0/+8 |
| | ||||
* | Another batch of strange entries. | Petter Reinholdtsen | 2015-01-13 | -1/+1 |
| | ||||
* | Add test script for elasticsearch. | Petter Reinholdtsen | 2015-01-13 | -0/+105 |
| | ||||
* | Reduce noice and do not log passwords. | Petter Reinholdtsen | 2015-01-04 | -1/+1 |
| | ||||
* | Scan a bit more, and stay 100 000 behind the current pointer. | Petter Reinholdtsen | 2015-01-04 | -2/+2 |
| | ||||
* | Make sure errors are reported. | Petter Reinholdtsen | 2015-01-04 | -1/+0 |
| | ||||
* | Make scraper more robust. | Petter Reinholdtsen | 2015-01-04 | -6/+8 |
| | ||||
* | Ny scraper. | Petter Reinholdtsen | 2015-01-04 | -0/+1 |
| | ||||
* | Improve scraper. | Petter Reinholdtsen | 2015-01-04 | -1/+4 |
| | ||||
* | New scraper for Nordreisa kommune. | Petter Reinholdtsen | 2015-01-04 | -0/+90 |
| | ||||
* | Improve message. | Petter Reinholdtsen | 2014-12-29 | -1/+2 |
| | ||||
* | Do not crash on non-existing URLs. | Petter Reinholdtsen | 2014-12-29 | -1/+3 |
| | ||||
* | Accept problematic pages. | Petter Reinholdtsen | 2014-12-28 | -2/+2 |
| | ||||
* | Disable debugging. | Petter Reinholdtsen | 2014-12-28 | -1/+1 |
| | ||||
* | Add meta info. | Petter Reinholdtsen | 2014-12-28 | -1/+9 |
| | ||||
* | Reduce the days scanning backwards. | Petter Reinholdtsen | 2014-12-27 | -1/+1 |
| | ||||
* | Check a larger time period to handle vacations. | Petter Reinholdtsen | 2014-12-27 | -2/+2 |
| | ||||
* | Flush stdout too. | Petter Reinholdtsen | 2014-12-22 | -0/+2 |
| | ||||
* | Improve output. | Petter Reinholdtsen | 2014-12-22 | -1/+2 |
| | ||||
* | More compact output. | Petter Reinholdtsen | 2014-12-22 | -3/+3 |
| | ||||
* | Done reparsing strange entries. | Petter Reinholdtsen | 2014-12-21 | -1/+1 |
| | ||||
* | Allow scraper to use more CPU. | Petter Reinholdtsen | 2014-12-21 | -1/+1 |
| |