lp://qastaging/~opensearch/opensearch/spider

Created by Vallery Lancey and last modified
Get this branch:
bzr branch lp://qastaging/~opensearch/opensearch/spider
Members of OpenSearch can upload to this branch. Log in for directions.

Branch merges

Related bugs

Related blueprints

Branch information

Owner:
OpenSearch
Project:
OpenSearch
Status:
Experimental

Recent revisions

7. By Vallery Lancey

Made the script update older entier instead of making duplicates. Bug 740637 has more details.

6. By Vallery Lancey

-Protected the script from crashing when getting a 404.
-Made script ignore the #label portion of URLs.
-Hacked protection against an odd crash on line 108 in which the while loop doesn't stop when it should.

5. By Vallery Lancey

Fixed an issue of duplicate URLs in some situations.

4. By Vallery Lancey

Replaced use of wget with urllib2 (abeit with possible odd issues). The script is probably cross-platform compatable now.

3. By Vallery Lancey

Made a few more prints happen only in debug mode.

2. By Vallery Lancey

-Consolidated index.py, links.py and keywords.py into spider.py.
-Added config.py.
-Tempfiles and DB now default to being in ~/.opensearch.

1. By Vallery Lancey

Uploaded a fairly functional search spider.

Branch metadata

Branch format:
Branch format 7
Repository format:
Bazaar repository format 2a (needs bzr 1.16 or later)
This branch contains Public information 
Everyone can see this information.

Subscribers