Changes between Initial Version and Version 1 of BadContent

Show
Ignore:
Timestamp:
01/29/10 11:38:09 (14 years ago)
Author:
mrst (IP: 84.202.200.48)
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • BadContent

    v0 v1  
     1{{{ 
     2tamiflu 
     3Aloha 
     4## taken from http://ferret.davebalmain.com/trac/wiki/BadContent 
     5## domains 
     6(?i)1webspace\.biz 
     7(?i)frocdjjd\.com 
     8(?i)dellxryq\.com 
     9(?i)cybele\.colorado\.edu 
     10(?i)(chezmoi|home|hometown|homepages)\.aol\. 
     11(?i)(ehome|ourworld)\.compuserve\. 
     12(?i)site\.voila\.fr 
     13(?i)membres\.lycos\.fr 
     14(?i)solia 
     15(?i)dnes.de/ 
     16(?i)homeftp.org/ 
     17(?i)ddo.jp/ 
     18(?i)lima-city.de/ 
     19(?i)ourworld-top.cs.com/ 
     20(?i)libero.it/ 
     21(?i)alice.it/ 
     22(?i)awardspace.info/ 
     23(?i)wezha.org 
     24(?i)34www.com/ 
     25(?i)boftec.com.cn/ 
     26(?i)bian.in/ 
     27(?i)iol.it/ 
     28(?i)lycos.nl/ 
     29(?i)rapa.jp/ 
     30(?i)blogshot.nl/ 
     31(?i)blogspot.com/ 
     32(?i)www\.cnfly\.hk 
     33(?i)prize2007\.freewebsites\.com/online-prize.html 
     34(?i)lottery01\.9999mb\.com/arizona-lottery.html 
     35(?i)betting\.b0x\.com/las-vegas-sports-betting.html 
     36(?i)bravehost\.com 
     37(?i)http://[._a-z0-9]*\.gmail\.com/ 
     38(?i)http://rubyforge\.org/tracker/download\.php/382/1527/8981 
     39(?i)http://.*tracker/download\.php/ 
     40(?i)www.makegamegold.com 
     41(?i)accountcoin.com 
     42(?i)asphost4free.com 
     43(?i)associates-program.com 
     44(?i)hhgwj.cn 
     45(?i)dibangykq.cn 
     46(?i)wood-product.com.cn 
     47(?i)iron-dvb.com.cn 
     48(?i)firstdrugstorezone.info 
     49(?i)wroughtironhouse.cn 
     50(?i)drugssx.com 
     51(?i)mobile-shop.kiev.ua 
     52(?i)drugs-xs.com 
     53## this is a bit radical, but looks like it is the only choice for now 
     54## disabling all http:// urls which are in trac-syle 
     55## we might bad all http:// urls if it isn't enough 
     56##(?i)\[\s*http://.*\]\s*\[ 
     57## a URL at the start of the ticket should be uncommon for real tickets 
     58^http:// 
     59## nasty, useless spam 
     60(?i)(nice|good|great|Best) site 
     61^[;:]\)$ 
     62^\?\?\?\? 
     63!!!!$ 
     64Good design! 
     65^[hH]i! http:// 
     66wow-gold 
     67world-of-warcraft-gold 
     68ionolsen 
     69credit card 
     70 
     71## obvious HTML markup 
     72##(?i)style\s*=\s*["' ]?display: 
     73##(?i)\[url=http:// 
     74##(?i)<a href= 
     75 
     76## text-patterns 
     77LED 
     78(?i)Viagra 
     79(?i)phentermine 
     80(?i)Hydrocodone 
     81(?i)Propecia 
     82(?i)Cialis 
     83(?i)Adipex 
     84(?i)Tramadol 
     85(?i)Ringtone 
     86(?i)japanese 
     87(?i)porn 
     88(?i)free\s+mp3 
     89(?i)mp3\s+music 
     90(?i)teen 
     91(?i)sex 
     92(?i)penis 
     93(?i)Gambl(e|ing) 
     94(?i)varicose 
     95(?i)mortgage 
     96(?i)refinance 
     97(?i)discount 
     98(?i)tamiflu 
     99(?i)airbed 
     100(?i)\srape\s 
     101(?i)\swrestling\s 
     102}}}