welcome: please sign in

Diff for "LocalBadContent"

Differences between revisions 13 and 50 (spanning 37 versions)
Revision 13 as of 2008-03-11 22:38:51
Size: 978
Comment: remove ambien, allow DomTool/LanguageReference revert to #12
Revision 50 as of 2010-09-11 12:13:49
Size: 6742
Editor: AdamChlipala
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
#acl All:read
Line 2: Line 3:
## parts added from http://wiki.python.org/moin, english sorted 2008/03/12
##
## Please add new patterns at the top so others can easily see
## if recent spam patterns have been taken care of.

websearchplanet.info

reasonable price
Reasonable price
Beijing
weight loss

# More untranslated Chinese characters from a particular spam edit that got through

































































































































































































































































# One particularly persistent bot begins edits with these phrases.
have a nice day
very good work
thank you for your work
please check this
nice to meet you
pleasure
comment1
comment2
comment3
comment4
comment5
comment6
comment7
comment8
comment9
comment0

alba

# Appears when some clueless bots get busy
Edit conflict - other version

# Untranslated Chinese characters that appeared in one particular spam page









# Beijing

# Bright/glorious

# happy

# not

# manufacture


# China

# sales


dudetube
girls
porn
webcam
nude
ephedrine
phendimetrazine
Describe .* here.
ringtone
naked
Turbo
Gold
Runescape
0755-[0-9]{8}
[0-9][0-9]bm\.(net|com)
17-china\.cn
\.76my\.com
(aaaoe|0571ax)\.com
adipex
(alumish|wovens)\.info
(answerbag|createblog)\.com
<a\shref=\shttp://
(asphost4free|gay-guys|bravejournal|clearblogs|hometown\.aol)\.com
azresults\.com
bacobolo\.com
baltimoremaps
(bambulka|infoarena|afxbmx|blog-italy|italy-blogs|italy-forum|bestinternetdirect)\.info
bathjunkie
Line 3: Line 351:
?lusiyao070719
adipex
(bestoplingerie|ebestopsolar|china-sexy-lingerie|chinabestop|fashion-emart|pcqshoes.com|smartsgarment|tsgarment)\.com
bestquery\.net
bestrepl
bfjxkf\.com
blogsitaly
blt\.net\.cn
byyoursidesade
Line 6: Line 359:
crusher cchello\.com
(cebiz|huochepiao168|korack|zjghtbf)\.cn
cheapwowgold|aygestin|onlinepoker|paxil|nigger|ringtones|xanax|tamiflu
(cheefmsn|iamlonelymsn|vvajcomsn|labicicletta|hutwistina)\.com
china01\.cn/
chinadrtv\.com
coachcartersoundtrack
coolingame
corolla-toyota
cp2y\.com
craps|lottery|poker|casinos|carisoprodol|hoodia|livesexinc
cwb120\.com
cytotec4you
Line 8: Line 373:
(deqinfy|toppowerlevel|dtdns|za\.spamim)\.net
Line 9: Line 375:
e89zmark
Line 10: Line 377:
excellent\ssite
Line 11: Line 379:
feilin\.ha\.cn
foreigntest@163\.com
(fossate|freedoot|uncross|direxit|ithinkso)\.info
(francebfl|lunwendx|rxmedslist|ifrance|nanhuachem)\.com
free-download-firefox\.net
free-downloads
freetesking
freetestking\.org
furnituretalks\.com
gbsh
giftsformotherofthebride
globalchineseedu
google[0-9]+
Google is my favorite search engine!
Line 12: Line 394:
gtqadqadq
guild\ wars\ gold
Line 14: Line 398:
hailianlitong.com
(handbagroom|longhainet|caxa|underwear-wholesaler)\.com
hangzhouhunqing\.com
heritagequest
heromovie
hgiwpiufhl.cn
hndfzx\.com|fengxiong
hollyginger\.com
hostia
howtomakeavatars
Line 15: Line 409:
(hzw1|servemp3|bjbcl|zjyihe)\.com
iamgaolijun\.cn
ibuy-sell\.com
Line 16: Line 413:
index[0-9]{3,}\.html
indexofmovies
i\senjoy\syour\ssite
isk-eve\.com
i\slove\sthis\ssite
johnsonip\.cn
jointroompia\.com
jordex\.net
juliabondvideos
jxpump\.com
kxy\.cn
lesbian-(sex|lovers)
levelmyth
Line 17: Line 427:
levitra|penis|swarsgerdif|hydrocodone|vicodin|(ambien )|(cialis )|( cialis)|wowpowerlevelings
liposome
listsearches\.net
Line 19: Line 432:
love(britneyspears|parishilton)\.net
lusiyao070719
lxep\.com
mapoflosangelescounty
Line 20: Line 437:
no1health\.org
onlinegoldsale\.com
\.(org|com|net)\.cn
(outdoor|dining).*furniture
ovp\.pl
paard
palabrasdeamor
paxu4\.org
porno
portlandoregonzipcode
power *leveling
(preteen|adware)
prostomario
PVD\+coatings
Line 21: Line 452:
rentatent
(rome|it(aly)?)-(software|programming)
rsitem\.com
sanyatravelguide\.com
(schoche|empocket|glozer|unsoncy|pensile|untombed|wickup|gilbart|tattan|elkview)\.info
scoobydoo2monstersunleashed
searchesmonitor\.com
SERVER ERROR
Line 23: Line 462:
(software|programming)-(rome|it(aly)?)
southbeachgirls
superpimper\.com
sup-lotro-gold\.com
(szgwjy|szwanyang|bj5yuehua|datangshutong|ofcpa|tcl)\.com
Line 24: Line 468:
sup-lotro-gold\.com
Line 26: Line 469:
(tandemos|smartice|towerid|samotta)\.net
tilefloorpatterns
topinfosearch\.com
toyota-corolla
[[]url=
usefull2u\.org
utopiaswirve
Line 28: Line 478:
viagra|fuck|diazepam|vicodin|celebrex|asshole|fu-ck|vagina
Line 29: Line 480:
vietnamadoptions
virtuale\.org.*antivirus
volny\.cz
wahlee\.net
Line 30: Line 485:
warcraft
Line 31: Line 487:
world of warcraft (warcus|mini-freegames|zalasoft)\.com
watch-replicas
windhorsetour\.com
worltopsearch\.net
www\.cnwfyy\.com
www\.hzzxbk\.com
www\.lengque\.cn
www\.qgja\.com
Line 33: Line 496:
# spamwords in traditional and simplified chinese xbkf120\.cn
x-shockwave-flash
(zhaoad|qualm|ganfushui|banj315|[0-9]+-happy|seo315|daiyun[0-9]+)\.cn
# spam words in traditional and simplified chinese
# (from Moin master's LocalBadContent)
Line 36: Line 503:
销售
Line 37: Line 505:
销售
# product
Line 41: Line 507:
# product
Line 43: Line 510:
# some spam words from a spam page which was very considerate and
# provided translations of a few Chinese words which should probably
# not occur on python.org under normal circumstances... ;-)
# LOVE

# MAN
男人
# WOMAN
女人
# Heart
Line 45: Line 523:
# and application of the principle of the new
理和应用
# Canadian immigration came before and after the attention of immigration matters
加拿大移民动身前及入境后的注意事项
# Dehumidifier
除湿机
# Hangzhou romantic life wedding Ltd.
杭州新浪漫一生婚庆有限公司
# hydraulic machine
液压机
# Industrial design
工业设计
# common words: this, that, department
# but members from China will be blocked if they use them



## This is a honeypot category to discourage autospammers which
## always seem to select the first category...
CategoryAaaBogusBogusBogus
\.cn

websearchplanet.info

reasonable price
Reasonable price
Beijing
weight loss

# More untranslated Chinese characters from a particular spam edit that got through

# One particularly persistent bot begins edits with these phrases.
have a nice day
very good work
thank you for your work
please check this
nice to meet you
pleasure
comment1
comment2
comment3
comment4
comment5
comment6
comment7
comment8
comment9
comment0

alba

# Appears when some clueless bots get busy
Edit conflict - other version

# Untranslated Chinese characters that appeared in one particular spam page

# Beijing
# Bright/glorious
# happy
# not
# manufacture

# China
# sales

dudetube
girls
porn
webcam
nude
ephedrine
phendimetrazine
Describe .* here.
ringtone
naked
Turbo
Gold
Runescape
0755-[0-9]{8}
[0-9][0-9]bm\.(net|com)
17-china\.cn
\.76my\.com
(aaaoe|0571ax)\.com
adipex
(alumish|wovens)\.info
(answerbag|createblog)\.com
<a\shref=\shttp://
(asphost4free|gay-guys|bravejournal|clearblogs|hometown\.aol)\.com
azresults\.com
bacobolo\.com
baltimoremaps
(bambulka|infoarena|afxbmx|blog-italy|italy-blogs|italy-forum|bestinternetdirect)\.info
bathjunkie
\bcialis\W
(bestoplingerie|ebestopsolar|china-sexy-lingerie|chinabestop|fashion-emart|pcqshoes.com|smartsgarment|tsgarment)\.com
bestquery\.net
bestrepl
bfjxkf\.com
blogsitaly
blt\.net\.cn
byyoursidesade
casino
cchello\.com
(cebiz|huochepiao168|korack|zjghtbf)\.cn
cheapwowgold|aygestin|onlinepoker|paxil|nigger|ringtones|xanax|tamiflu
(cheefmsn|iamlonelymsn|vvajcomsn|labicicletta|hutwistina)\.com
china01\.cn/
chinadrtv\.com
coachcartersoundtrack
coolingame
corolla-toyota
cp2y\.com
craps|lottery|poker|casinos|carisoprodol|hoodia|livesexinc
cwb120\.com
cytotec4you
daiyunbb\.cn
(deqinfy|toppowerlevel|dtdns|za\.spamim)\.net
dgsmsj\.com
e89zmark
ephedra
excellent\ssite
eyelash\.net\.cn
feilin\.ha\.cn
foreigntest@163\.com
(fossate|freedoot|uncross|direxit|ithinkso)\.info
(francebfl|lunwendx|rxmedslist|ifrance|nanhuachem)\.com
free-download-firefox\.net
free-downloads
freetesking
freetestking\.org
furnituretalks\.com
gbsh
giftsformotherofthebride
globalchineseedu
google[0-9]+
Google is my favorite search engine!
granitecountertops\.com\.cn
gtqadqadq
guild\ wars\ gold
gzjxhj\.com
HaCKeD_BY_CRueL
hailianlitong.com
(handbagroom|longhainet|caxa|underwear-wholesaler)\.com
hangzhouhunqing\.com
heritagequest
heromovie
hgiwpiufhl.cn
hndfzx\.com|fengxiong
hollyginger\.com
hostia
howtomakeavatars
hydrocodone
(hzw1|servemp3|bjbcl|zjyihe)\.com
iamgaolijun\.cn
ibuy-sell\.com
imageshack\.us
index[0-9]{3,}\.html
indexofmovies
i\senjoy\syour\ssite
isk-eve\.com
i\slove\sthis\ssite
johnsonip\.cn
jointroompia\.com
jordex\.net
juliabondvideos
jxpump\.com
kxy\.cn
lesbian-(sex|lovers)
levelmyth
levitra
levitra|penis|swarsgerdif|hydrocodone|vicodin|(ambien )|(cialis )|( cialis)|wowpowerlevelings
liposome
listsearches\.net
longhainet\.com
Lotro-Power-Leveling-lotro
love(britneyspears|parishilton)\.net
lusiyao070719
lxep\.com
mapoflosangelescounty
mortgage
no1health\.org
onlinegoldsale\.com
\.(org|com|net)\.cn
(outdoor|dining).*furniture
ovp\.pl
paard
palabrasdeamor
paxu4\.org
porno
portlandoregonzipcode
power *leveling
(preteen|adware)
prostomario
PVD\+coatings
refinance
rentatent
(rome|it(aly)?)-(software|programming)
rsitem\.com
sanyatravelguide\.com
(schoche|empocket|glozer|unsoncy|pensile|untombed|wickup|gilbart|tattan|elkview)\.info
scoobydoo2monstersunleashed
searchesmonitor\.com
SERVER ERROR
shachepan\.net
shzgpv\.com
(software|programming)-(rome|it(aly)?)
southbeachgirls
superpimper\.com
sup-lotro-gold\.com
(szgwjy|szwanyang|bj5yuehua|datangshutong|ofcpa|tcl)\.com
szwx\.cn
taiyangzao\.net
(tandemos|smartice|towerid|samotta)\.net
tilefloorpatterns
topinfosearch\.com
toyota-corolla
[[]url=
usefull2u\.org
utopiaswirve
valium
viagra
viagra|fuck|diazepam|vicodin|celebrex|asshole|fu-ck|vagina
vicodin
vietnamadoptions
virtuale\.org.*antivirus
volny\.cz
wahlee\.net
wap.monternet.com
warcraft
warcus\.com
(warcus|mini-freegames|zalasoft)\.com
watch-replicas
windhorsetour\.com
worltopsearch\.net
www\.cnwfyy\.com
www\.hzzxbk\.com
www\.lengque\.cn
www\.qgja\.com
xanax
xbkf120\.cn
x-shockwave-flash
(zhaoad|qualm|ganfushui|banj315|[0-9]+-happy|seo315|daiyun[0-9]+)\.cn
# spam words in traditional and simplified chinese
# (from Moin master's LocalBadContent)
# sales
銷售
销售
# price
售價
售价
# product
產品
产品
# some spam words from a spam page which was very considerate and
# provided translations of a few Chinese words which should probably
# not occur on python.org under normal circumstances... ;-)
# LOVE
# MAN
男人
# WOMAN
女人
# Heart
# Alopecia, loss of hair
脱发
# common words: this, that, department
# but members from China will be blocked if they use them
## This is a honeypot category to discourage autospammers which
## always seem to select the first category...
CategoryAaaBogusBogusBogus
\.cn

LocalBadContent (last edited 2013-03-11 17:56:17 by ClintonEbadi)