aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeLines
...
* Remember charmap handling.Petter Reinholdtsen2014-12-18-1/+1
|
* HTML scraper is working, save its result.Petter Reinholdtsen2014-12-18-3/+3
|
* Add vendor.Petter Reinholdtsen2014-12-18-1/+1
|
* Map doctype.Petter Reinholdtsen2014-12-18-0/+4
|
* Fix typo.Petter Reinholdtsen2014-12-18-2/+2
|
* More typos.Petter Reinholdtsen2014-12-18-2/+2
|
* Typo.Petter Reinholdtsen2014-12-18-1/+1
|
* First version of the HTML parser.Petter Reinholdtsen2014-12-18-2/+143
|
* New scraper.Petter Reinholdtsen2014-12-18-0/+1
|
* New scraper for the University of Tromsø. Not yet complete.Petter Reinholdtsen2014-12-17-0/+95
|
* Add meta-info.Petter Reinholdtsen2014-12-17-1/+12
|
* Get scraper working again.Petter Reinholdtsen2014-12-14-8/+8
|
* Disable missing pages.Petter Reinholdtsen2014-12-14-6/+6
|
* Typo.Petter Reinholdtsen2014-12-14-2/+2
|
* Correct URLs.Petter Reinholdtsen2014-12-14-2/+13
|
* Get script working with local SQlite files.Petter Reinholdtsen2014-12-13-31/+74
|
* Handle unlimited CPU quota.Petter Reinholdtsen2014-12-13-2/+2
|
* Make scraper more robust.Petter Reinholdtsen2014-12-13-6/+8
|
* Mer meta-info.Petter Reinholdtsen2014-12-13-2/+10
|
* Add duration.Petter Reinholdtsen2014-12-10-0/+1
|
* Quiet down URL extracter.Petter Reinholdtsen2014-12-10-2/+2
|
* Well known ordering when reparsing.Petter Reinholdtsen2014-12-10-1/+1
|
* Might as well flush the buffer when touching the database to remove an entry.Petter Reinholdtsen2014-12-10-0/+3
|
* Typo.Petter Reinholdtsen2014-12-10-1/+1
|
* Add meta info.Petter Reinholdtsen2014-12-10-1/+11
|
* Add forgotten scraper.Petter Reinholdtsen2014-12-10-0/+111
|
* Start on new scraper.Petter Reinholdtsen2014-12-10-0/+164
|
* Add code to try again to load some broken entries in the database.Petter Reinholdtsen2014-12-10-1/+21
| | | | Increase the amount fetched in from 3000 to 5000 the rescan code.
* Make sure unicode string is used for non-ascii strings.Petter Reinholdtsen2014-12-10-1/+1
|
* More meta-info.Petter Reinholdtsen2014-12-10-2/+21
|
* Fix rescan.Petter Reinholdtsen2014-12-09-8/+9
|
* Quiet down and remove unused code.Petter Reinholdtsen2014-12-09-7/+3
|
* Add code to rescan old IDs in case they changed.Petter Reinholdtsen2014-12-09-0/+9
|
* Reduce backward parsing now that we have the complete collection.Petter Reinholdtsen2014-12-09-4/+4
|
* Handle latest entries.Petter Reinholdtsen2014-12-09-3/+11
|
* Get working in new environment and make more robust.Petter Reinholdtsen2014-12-09-6/+19
|
* Add meta-info about some scrapers.Petter Reinholdtsen2014-12-08-3/+30
|
* Get runner working.Petter Reinholdtsen2014-12-08-3/+5
|
* Add meta-info about some scrapers.Petter Reinholdtsen2014-12-08-2/+11
|
* Add a few extra scrapers.Petter Reinholdtsen2014-12-08-0/+260
|
* Add duration.Petter Reinholdtsen2014-12-08-0/+1
|
* Add meta-info about some scrapers.Petter Reinholdtsen2014-12-08-2/+16
|
* New scraper.Petter Reinholdtsen2014-12-08-0/+1
|
* Add meta-info about some scrapers.Petter Reinholdtsen2014-12-08-5/+49
|
* Add meta-info about some scrapers.Petter Reinholdtsen2014-12-08-2/+49
|
* Adjust for new location.Petter Reinholdtsen2014-12-07-3/+3
|
* More details.Petter Reinholdtsen2014-12-07-4/+6
|
* New URL.Petter Reinholdtsen2014-12-07-2/+3
|
* Add meta-info about some scrapers.Petter Reinholdtsen2014-12-07-2/+19
|
* Add meta-info about some scrapers.Petter Reinholdtsen2014-12-07-6/+53
|