Commit Graph

135 Commits

Author SHA1 Message Date
Corentin Chary
28b913295a euscan: ignore robots.txt for fedorahosted.org
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-07-25 17:04:37 +02:00
volpino
112313ab7f euscan: generic handler fixes
* urljoin was broken with dirs ("http://site.com/lol" joined with "wat"
  produces http://site.com/wat)
* fixed _v regex, don't match what is not needed

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:17 +02:00
volpino
9f164ed22c euscanwww: Removing useless stuff while creating watch tag
* Removing useless options or actions
* Fixing handling of substitution regex

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:17 +02:00
volpino
e2890c6364 euscan: Added watch handler, fixed generic one
Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:17 +02:00
volpino
1cf5d0726a euscan: handlers now use a Package object instead of cpv
Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:16 +02:00
volpino
67654ee0b4 euscan: Progressbar on stderr
Signed-off-by: volpino <fox91@anche.no>
2012-07-25 16:53:16 +02:00
volpino
21c6a9107d euscan: generic handler fixes
* urljoin was broken with dirs ("http://site.com/lol" joined with "wat"
  produces http://site.com/wat)
* fixed _v regex, don't match what is not needed

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 10:44:15 +02:00
volpino
29101f88f8 euscanwww: Removing useless stuff while creating watch tag
* Removing useless options or actions
* Fixing handling of substitution regex

Signed-off-by: volpino <fox91@anche.no>
2012-07-25 10:19:27 +02:00
volpino
c35065e344 euscan: Added watch handler, fixed generic one
Signed-off-by: volpino <fox91@anche.no>
2012-07-24 17:48:20 +02:00
volpino
8e08b6a33b euscan: handlers now use a Package object instead of cpv
Signed-off-by: volpino <fox91@anche.no>
2012-07-24 08:56:00 +02:00
volpino
2b09bfba8a euscan: Progressbar on stderr
Signed-off-by: volpino <fox91@anche.no>
2012-07-24 08:10:56 +02:00
volpino
ad02fd4b81 euscanwww: Added ansi to html helper
Signed-off-by: volpino <fox91@anche.no>
2012-07-20 15:12:44 +02:00
Corentin Chary
5fb076b304 euscan: remove trailing comma
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-07-20 09:06:22 +02:00
Corentin Chary
8743e9d1e1 euscan: move version type helpers to euscan.version
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-07-20 09:04:15 +02:00
Corentin Chary
634e06b779 euscan: don't fail if robots.txt can't be read
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-07-20 08:41:13 +02:00
Corentin Chary
bcda1a7ce3 euscan: add --ebuild-uri to use ${PV}, etc.
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-07-20 08:41:12 +02:00
volpino
be146163f6 euscanwww: Displaying unstable versions in package view
Signed-off-by: volpino <fox91@anche.no>
2012-07-17 13:18:44 +02:00
volpino
33a06d99fb euscan: added --ignore-pre-release and --ignore-pre-release-if-stable
options

Signed-off-by: volpino <fox91@anche.no>
2012-07-17 12:27:22 +02:00
volpino
3f0cef09e0 euscan: Fixed typo in out.py. euscan output was broken
Signed-off-by: volpino <fox91@anche.no>
2012-07-10 17:41:24 +02:00
volpino
02be58024a euscanwww: Minor fixes
Signed-off-by: volpino <fox91@anche.no>
2012-07-09 18:15:34 +02:00
volpino
03cf3b90a4 euscanwww: Upgraded to Celery 3.0 and fixed scan_portage
Fixed TaskSet calls and synchronous stuff. Everything should be
deadlock safe

Signed-off-by: volpino <fox91@anche.no>
2012-07-07 18:09:14 +02:00
volpino
5a2f2e9eab Merge branch 'master' of https://github.com/iksaif/euscan 2012-07-02 17:45:31 +02:00
volpino
6407efa14f euscan: naive implementation of --mirror
Signed-off-by: volpino <fox91@anche.no>
2012-07-02 17:44:55 +02:00
Corentin Chary
90702ddee8 euscan: don't import pkg_resources, we don't want all these warnings
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-07-02 11:20:52 +02:00
Corentin Chary
728770088c euscan: don't respect robots.txt for mono download site
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-06-30 15:57:56 +02:00
volpino
fc7f929038 euscanwww: Fixed broken tasks
Signed-off-by: volpino <fox91@anche.no>
2012-06-29 17:25:45 +02:00
volpino
d7bf130fda euscan: minor fixes
Signed-off-by: volpino <fox91@anche.no>
2012-06-28 12:43:46 +02:00
volpino
0d6376681f euscan: fixed progressbar
added -p option, now the progressbar is "global" and shows the total
progress

Signed-off-by: volpino <fox91@anche.no>
2012-06-28 12:20:57 +02:00
volpino
b06965f386 euscan: plugin system for handlers
Signed-off-by: volpino <fox91@anche.no>
2012-06-28 11:08:05 +02:00
volpino
8fdac1014c euscanwww: Adding infos to EuscanResults
Now we store metadata, output from euscan is cleaned from colors
encoding when outputting json

Signed-off-by: volpino <fox91@anche.no>
2012-06-26 18:13:28 +02:00
volpino
d7e0a4ca16 euscanwww: Tweaked processing.scan_portage to use eix --xml
Signed-off-by: volpino <fox91@anche.no>
2012-06-26 17:16:02 +02:00
volpino
9af5548178 Merge branch 'master' of https://github.com/iksaif/euscan 2012-06-22 09:34:11 +02:00
Michał Górny
a3fe7a21e7 github: return the regular download URL instead. 2012-06-14 12:56:45 +02:00
Michał Górny
5af479fea8 Handle exceptions in scanning loop. 2012-06-14 12:38:52 +02:00
Michał Górny
7cbe517a70 Initial support for github downloads.
Right now, it just handles ebuilds using mirror://github.
2012-06-14 12:34:39 +02:00
volpino
6cc892717b Merge branch 'master' of https://github.com/iksaif/euscan 2012-06-14 10:21:37 +02:00
volpino
bd971a7da0 euscanwww: When scanning upstream output object was never cleaned
Signed-off-by: volpino <fox91@anche.no>
2012-06-12 14:55:58 +02:00
volpino
9cddc82808 Merge branch 'master' of https://github.com/iksaif/euscan
Conflicts:
	euscanwww/djeuscan/models.py

Signed-off-by: volpino <fox91@anche.no>
2012-06-08 14:46:37 +02:00
volpino
b43bf8fa28 euscanwww: Removing duplicated code
Signed-off-by: volpino <fox91@anche.no>
2012-06-08 14:43:18 +02:00
Tim Harder
5df28b028e add support for Beautiful Soup 4 2012-06-08 00:18:59 -07:00
volpino
47ec539e1d euscanwww: scan_upstream imports euscan.scan
scan_upstream command imports euscan.scan and simply calls a function,
it doesn't call a subprocess

Signed-off-by: volpino <fox91@anche.no>
2012-05-31 18:42:37 +02:00
Corentin Chary
3437cf5e4b euscan: cleanups
Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-05-28 07:02:41 +02:00
volpino
a224cb27cb euscan: naive progressbar implementation
Progressbar for formatted output

Signed-off-by: volpino <fox91@anche.no>
2012-05-27 10:23:15 +02:00
volpino
31bd4dc55e euscan: adjusted code to be PEP8 compliant
Signed-off-by: volpino <fox91@anche.no>
2012-05-25 10:23:33 +02:00
Corentin Chary
305fdea0c7 euscan: make eend works again
Simplify einfo/ewarn/eend handling (for now) and make "message" someting
containing only what stdout would contain. We loose some informations, but
I'm really not sure we need more.

Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-05-25 00:19:17 +02:00
Corentin Chary
687851ffcb Merge git://github.com/volpino/euscan
Conflicts:
	pym/euscan/scan.py
2012-05-24 23:47:55 +02:00
Corentin Chary
177d77842f euscan: allow to use .ebuild files in any directory
Even outside PORTDIR_OVERLAY !

Signed-off-by: Corentin Chary <corentin.chary@gmail.com>
2012-05-24 23:44:05 +02:00
volpino
4d606ae1cf euscan: added scan time to formatted output
scan_time in metadata dictionary

Signed-off-by: volpino <fox91@anche.no>
2012-05-24 20:27:18 +02:00
volpino
9989433ad5 euscan: detect version type
In the formatted output display the version type (beta, alpha, rc, ...)

Signed-off-by: volpino <fox91@anche.no>
2012-05-23 23:08:07 +02:00
volpino
1efeee4134 euscan: added simple support to result confidence
Confidence values are meaningless at the moment. Need to fix it.

Signed-off-by: volpino <fox91@anche.no>
2012-05-23 16:44:44 +02:00
volpino
a18083bd98 euscan: json format output
Now "-f json" seems good, it supports the handler type used to retrieve
each version and outputs metadata.

Signed-off-by: volpino <fox91@anche.no>
2012-05-23 16:30:43 +02:00
volpino
8cb19b5a6b euscan: adding json output
Naive json output implmented, probably needs some further tuning

Signed-off-by: volpino <fox91@anche.no>
2012-05-21 22:38:38 +02:00
volpino
373fba6e01 euscan: fixed patched versions issue
The old code extracted version 0.8.6.1 from buildbot-0.8.6p1 instead of
0.8.6_p1

Signed-off-by: volpino <fox91@anche.no>
2012-05-21 12:03:14 +02:00
volpino
058b50556b issue #4 2012-05-05 14:28:48 +02:00
volpino
8c91855a58 Lovely day for PEP8 and pylint! 2012-04-28 18:16:05 +02:00
Corentin Chary
6e0706d638 euscan: fix cpan version ordering
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-04-23 17:05:23 +02:00
Corentin Chary
b0ccdd2157 euscan: new cpan handler
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-04-20 17:27:16 +02:00
Corentin Chary
3da5fb5240 euscan: mangle versions the gentoo-way
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-04-19 15:36:44 +02:00
Corentin Chary
6a57b44d7c euscan: force nodejs.org scan
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-04-04 14:34:09 +02:00
Corentin Chary
76a1121c9f euscan: don't keep version from partial paths
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-03-28 09:09:08 +02:00
Corentin Chary
5062f29e3c euscanwww: don't delete packages when deleting related versions
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-03-14 14:07:28 +01:00
Corentin Chary
329b20fecf euscan/rubygem: better gem detection
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-03-02 13:54:23 +01:00
Corentin Chary
df1d778365 euscan: add a quick /_rc/-rc/ hack
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-02-20 08:21:07 +01:00
Corentin Chary
5bd358968a euscan: re-indent blacklists
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-02-20 08:20:54 +01:00
Corentin Chary
8190779070 euscan: remove a line
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-02-14 14:54:34 +01:00
Corentin Chary
72e43ac4fe euscan: tweak error paths
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2012-01-02 21:31:05 +01:00
Corentin Chary
b78e73038d euscan: add kde handler and fix some regex issues
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-30 11:34:39 +01:00
Corentin Chary
fef31c514c euscan: respect --quiet in pypi handler
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-12 15:15:00 +01:00
Corentin Chary
3569f1bb97 euscan: better --quiet mode
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-12-11 14:32:38 +01:00
Corentin Chary
a11bfdc54f euscan: fix a bug with htop versions
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-11-27 16:25:44 +01:00
Corentin Chary
8be69f2350 euscan: show a trace when blocked by robots.txt
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-08 08:33:48 +02:00
Corentin Chary
739358cf7c euscan: really skip all dev versions
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-08 08:33:25 +02:00
Corentin Chary
277fb4ebe6 euscan: add new robots.txt exceptions
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-08 08:33:03 +02:00
Corentin Chary
d7f655cdde euscan: add an optional persistent cache
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-10-02 10:04:44 +02:00
Corentin Chary
7fb4e6edff euscan: better output
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-21 10:58:41 +02:00
Corentin Chary
14971584af euscan: robots.txt, timeout, user-agent, ...
- Add a blacklist for robots.txt, we *want* to scan sourceforge
- Set a user-agent that doesn't looks like a browser
- Handle timeouts more carefully
- If brute force detect too much versions, avoid infinite loops
- Handle redirections more carefully

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-21 10:09:50 +02:00
Corentin Chary
8c40a1795c euscan: blacklist art.gnome.org
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-10 08:25:39 +02:00
Corentin Chary
9da62b211b euscan: fix some robots.txt issues
- disable checks for ftp
- fail silently
- use einfo and not eerror

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-10 08:23:46 +02:00
Corentin Chary
c5af0e1937 euscan: don't mix spaces and tabs
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 17:35:17 +02:00
Corentin Chary
2210b2610d euscan: don't get robots.txt on ftp
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 17:34:50 +02:00
Corentin Chary
a137ef60e3 euscan: respect robots.txt
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 16:32:29 +02:00
Corentin Chary
bd75e1af4e euscan/helpers: use HEAD in tryurl
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 15:47:54 +02:00
Corentin Chary
454d369ced euscan/handlers: fix resursive brute force in generic handler
component was modified by the function since it's a reference,
do an explicit copy to fix that.

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 15:47:00 +02:00
Corentin Chary
8dc19b9856 euscan: fix some errors
Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-09-06 09:17:08 +02:00
Corentin Chary
752fb04425 euscan: shake the code
- add custom site handlers
- use a custom user agent
- fix some bugs in management commands

Signed-off-by: Corentin Chary <corentincj@iksaif.net>
2011-08-31 15:38:32 +02:00