Corentin Chary
							
						 
					 | 
					
						
						
							
						
						8c40a1795c
					 | 
					
						
						
							
							euscan: blacklist art.gnome.org
						
						
						
						
						
						
						
						Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-10 08:25:39 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						9da62b211b
					 | 
					
						
						
							
							euscan: fix some robots.txt issues
						
						
						
						
						
						
						
						- disable checks for ftp
- fail silently
- use einfo and not eerror
Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-10 08:23:46 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						c5af0e1937
					 | 
					
						
						
							
							euscan: don't mix spaces and tabs
						
						
						
						
						
						
						
						Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-06 17:35:17 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						2210b2610d
					 | 
					
						
						
							
							euscan: don't get robots.txt on ftp
						
						
						
						
						
						
						
						Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-06 17:34:50 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						a137ef60e3
					 | 
					
						
						
							
							euscan: respect robots.txt
						
						
						
						
						
						
						
						Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-06 16:32:29 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						bd75e1af4e
					 | 
					
						
						
							
							euscan/helpers: use HEAD in tryurl
						
						
						
						
						
						
						
						Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-06 15:47:54 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						454d369ced
					 | 
					
						
						
							
							euscan/handlers: fix resursive brute force in generic handler
						
						
						
						
						
						
						
						component was modified by the function since it's a reference,
do an explicit copy to fix that.
Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-06 15:47:00 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						8dc19b9856
					 | 
					
						
						
							
							euscan: fix some errors
						
						
						
						
						
						
						
						Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-09-06 09:17:08 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Corentin Chary
							
						 
					 | 
					
						
						
							
						
						752fb04425
					 | 
					
						
						
							
							euscan: shake the code
						
						
						
						
						
						
						
						- add custom site handlers
- use a custom user agent
- fix some bugs in management commands
Signed-off-by: Corentin Chary <corentincj@iksaif.net> 
						
						
					 | 
					
						2011-08-31 15:38:32 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 |