Commit Graph

71 Commits

Author SHA1 Message Date
Corentin Chary
8190779070 euscan: remove a line
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-02-14 14:54:34 +01:00
Corentin Chary
72e43ac4fe euscan: tweak error paths
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-01-02 21:31:05 +01:00
Corentin Chary
b78e73038d euscan: add kde handler and fix some regex issues
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-30 11:34:39 +01:00
Corentin Chary
fef31c514c euscan: respect --quiet in pypi handler
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-12 15:15:00 +01:00
Corentin Chary
3569f1bb97 euscan: better --quiet mode
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-11 14:32:38 +01:00
Corentin Chary
a11bfdc54f euscan: fix a bug with htop versions
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-11-27 16:25:44 +01:00
Corentin Chary
8be69f2350 euscan: show a trace when blocked by robots.txt
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-08 08:33:48 +02:00
Corentin Chary
739358cf7c euscan: really skip all dev versions
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-08 08:33:25 +02:00
Corentin Chary
277fb4ebe6 euscan: add new robots.txt exceptions
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-08 08:33:03 +02:00
Corentin Chary
d7f655cdde euscan: add an optional persistent cache
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-02 10:04:44 +02:00
Corentin Chary
7fb4e6edff euscan: better output
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-21 10:58:41 +02:00
Corentin Chary
14971584af euscan: robots.txt, timeout, user-agent, ...
- Add a blacklist for robots.txt, we *want* to scan sourceforge
- Set a user-agent that doesn't looks like a browser
- Handle timeouts more carefully
- If brute force detect too much versions, avoid infinite loops
- Handle redirections more carefully

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-21 10:09:50 +02:00
Corentin Chary
8c40a1795c euscan: blacklist art.gnome.org
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-10 08:25:39 +02:00
Corentin Chary
9da62b211b euscan: fix some robots.txt issues
- disable checks for ftp
- fail silently
- use einfo and not eerror

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-10 08:23:46 +02:00
Corentin Chary
c5af0e1937 euscan: don't mix spaces and tabs
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 17:35:17 +02:00
Corentin Chary
2210b2610d euscan: don't get robots.txt on ftp
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 17:34:50 +02:00
Corentin Chary
a137ef60e3 euscan: respect robots.txt
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 16:32:29 +02:00
Corentin Chary
bd75e1af4e euscan/helpers: use HEAD in tryurl
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 15:47:54 +02:00
Corentin Chary
454d369ced euscan/handlers: fix resursive brute force in generic handler
component was modified by the function since it's a reference,
do an explicit copy to fix that.

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 15:47:00 +02:00
Corentin Chary
8dc19b9856 euscan: fix some errors
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 09:17:08 +02:00
Corentin Chary
752fb04425 euscan: shake the code
- add custom site handlers
- use a custom user agent
- fix some bugs in management commands

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-08-31 15:38:32 +02:00