Commit Graph

40 Commits

Author SHA1 Message Date
Corentin Chary
8d91237988 euscan/handlers: rework handlers, better remote-id + watch support
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-08-03 21:50:54 +02:00
volpino
d262b935e6 euscan: hopefully better CPAN gentoo version mangling
Signed-off-by: volpino <fox91@anche.no>
2012-07-31 09:50:11 +02:00
volpino
c49914af23 euscan: fixed handler name with remote-id
Signed-off-by: volpino <fox91@anche.no>
2012-07-28 18:16:16 +02:00
volpino
326658acb9 euscan: confidence tweak in generic handler
Signed-off-by: volpino <fox91@anche.no>
2012-07-27 11:26:54 +02:00
volpino
35603fd704 euscan: Added remote-id handler
Signed-off-by: volpino <fox91@anche.no>
2012-07-26 17:46:53 +02:00
volpino
1835327d5c euscan: Fixed typo
Signed-off-by: volpino <fox91@anche.no>
2012-07-26 14:50:47 +02:00
volpino
3ef800b68d Merge branch 'master' of https://github.com/iksaif/euscan
Conflicts:
	bin/euscan_patch_metadata
	euscanwww/djeuscan/views.py
	pym/euscan/handlers/__init__.py
	pym/euscan/handlers/kde.py
	pym/euscan/handlers/url/cpan.py
	pym/euscan/handlers/url/generic.py
	pym/euscan/handlers/url/github.py
	pym/euscan/handlers/url/php.py
	pym/euscan/handlers/url/pypi.py
	pym/euscan/handlers/url/rubygem.py
	pym/euscan/scan.py

Signed-off-by: volpino <fox91@anche.no>
2012-07-26 10:57:38 +02:00
volpino
12bf1fc75b euscan: Redesigning the handlers layout
Signed-off-by: volpino <fox91@anche.no>
2012-07-26 10:44:10 +02:00
volpino
112313ab7f euscan: generic handler fixes
* urljoin was broken with dirs ("http://site.com/lol" joined with "wat"
  produces http://site.com/wat)
* fixed _v regex, don't match what is not needed

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:17 +02:00
volpino
9f164ed22c euscanwww: Removing useless stuff while creating watch tag
* Removing useless options or actions
* Fixing handling of substitution regex

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:17 +02:00
volpino
e2890c6364 euscan: Added watch handler, fixed generic one
Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:17 +02:00
volpino
1cf5d0726a euscan: handlers now use a Package object instead of cpv
Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:16 +02:00
volpino
21c6a9107d euscan: generic handler fixes
* urljoin was broken with dirs ("http://site.com/lol" joined with "wat"
  produces http://site.com/wat)
* fixed _v regex, don't match what is not needed

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 10:44:15 +02:00
volpino
29101f88f8 euscanwww: Removing useless stuff while creating watch tag
* Removing useless options or actions
* Fixing handling of substitution regex

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 10:19:27 +02:00
volpino
c35065e344 euscan: Added watch handler, fixed generic one
Signed-off-by: volpino <fox91@anche.no>
2012-07-24 17:48:20 +02:00
volpino
8e08b6a33b euscan: handlers now use a Package object instead of cpv
Signed-off-by: volpino <fox91@anche.no>
2012-07-24 08:56:00 +02:00
volpino
d7bf130fda euscan: minor fixes
Signed-off-by: volpino <fox91@anche.no>
2012-06-28 12:43:46 +02:00
volpino
b06965f386 euscan: plugin system for handlers
Signed-off-by: volpino <fox91@anche.no>
2012-06-28 11:08:05 +02:00
Michał Górny
a3fe7a21e7 github: return the regular download URL instead. 2012-06-14 12:56:45 +02:00
Michał Górny
7cbe517a70 Initial support for github downloads.
Right now, it just handles ebuilds using mirror://github.
2012-06-14 12:34:39 +02:00
Tim Harder
5df28b028e add support for Beautiful Soup 4 2012-06-08 00:18:59 -07:00
volpino
1efeee4134 euscan: added simple support to result confidence
Confidence values are meaningless at the moment. Need to fix it.

Signed-off-by: volpino <fox91@anche.no>
2012-05-23 16:44:44 +02:00
volpino
a18083bd98 euscan: json format output
Now "-f json" seems good, it supports the handler type used to retrieve
each version and outputs metadata.

Signed-off-by: volpino <fox91@anche.no>
2012-05-23 16:30:43 +02:00
volpino
373fba6e01 euscan: fixed patched versions issue
The old code extracted version 0.8.6.1 from buildbot-0.8.6p1 instead of
0.8.6_p1

Signed-off-by: volpino <fox91@anche.no>
2012-05-21 12:03:14 +02:00
volpino
8c91855a58 Lovely day for PEP8 and pylint! 2012-04-28 18:16:05 +02:00
Corentin Chary
6e0706d638 euscan: fix cpan version ordering
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-04-23 17:05:23 +02:00
Corentin Chary
b0ccdd2157 euscan: new cpan handler
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-04-20 17:27:16 +02:00
Corentin Chary
3da5fb5240 euscan: mangle versions the gentoo-way
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-04-19 15:36:44 +02:00
Corentin Chary
76a1121c9f euscan: don't keep version from partial paths
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-03-28 09:09:08 +02:00
Corentin Chary
329b20fecf euscan/rubygem: better gem detection
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-03-02 13:54:23 +01:00
Corentin Chary
df1d778365 euscan: add a quick /_rc/-rc/ hack
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-02-20 08:21:07 +01:00
Corentin Chary
72e43ac4fe euscan: tweak error paths
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-01-02 21:31:05 +01:00
Corentin Chary
b78e73038d euscan: add kde handler and fix some regex issues
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-30 11:34:39 +01:00
Corentin Chary
fef31c514c euscan: respect --quiet in pypi handler
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-12 15:15:00 +01:00
Corentin Chary
3569f1bb97 euscan: better --quiet mode
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-11 14:32:38 +01:00
Corentin Chary
14971584af euscan: robots.txt, timeout, user-agent, ...
- Add a blacklist for robots.txt, we *want* to scan sourceforge
- Set a user-agent that doesn't looks like a browser
- Handle timeouts more carefully
- If brute force detect too much versions, avoid infinite loops
- Handle redirections more carefully

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-21 10:09:50 +02:00
Corentin Chary
a137ef60e3 euscan: respect robots.txt
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 16:32:29 +02:00
Corentin Chary
454d369ced euscan/handlers: fix resursive brute force in generic handler
component was modified by the function since it's a reference,
do an explicit copy to fix that.

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 15:47:00 +02:00
Corentin Chary
8dc19b9856 euscan: fix some errors
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 09:17:08 +02:00
Corentin Chary
752fb04425 euscan: shake the code
- add custom site handlers
- use a custom user agent
- fix some bugs in management commands

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-08-31 15:38:32 +02:00