Oracle, Where Shall I Submit My Precious Papers? IST Faculty Brown Bag Sep. 22, 2006 Dongwon Lee Sep. 22, 2006 2 Credits Students Ergin Elmacioglu (CSE, Penn State) Su Yan (IST, Penn State) Ziming Zhuang (IST, Penn State) Colleagues Lee Giles (Penn State) Min-Yen Kan (NUS, Singapore) Jaewoo Kang (Korea U., Korea) Divesh Srivastava (AT&T Labs – Research)
17
Embed
Oracle, Where Shall I Submit My Precious Papers?pike.psu.edu/presentations/oracle.pdf · 1 Oracle, Where Shall I Submit My Precious Papers? IST Faculty Brown Bag Sep. 22, 2006 Dongwon
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
1
Oracle, Where Shall I Submit My Precious Papers?
IST Faculty Brown BagSep. 22, 2006
Dongwon Lee
Sep. 22, 2006 2
Credits
StudentsErgin Elmacioglu (CSE, Penn State)Su Yan (IST, Penn State)Ziming Zhuang (IST, Penn State)
ColleaguesLee Giles (Penn State)Min-Yen Kan (NUS, Singapore)Jaewoo Kang (Korea U., Korea)Divesh Srivastava (AT&T Labs – Research)
2
Sep. 22, 2006 3
What do I do?
Databases /Data Mining
Digital Libraries / Info. Retrieval
XML / Web
Sep. 22, 2006 4
What projects do I do?
Databases /Data Mining
Digital Libraries / Info. Retrieval
XML / WebIBM Eclipse, 2004 & 2006Penn State eBRC, 2005
Microsoft SciData 2005
NSF OISE 2006
Today’s Talk
3
Sep. 22, 2006 5
Outline
MotivationSimple StudyResultsSummary
Sep. 22, 2006 6
MIT’s Prankhttp://pdos.csail.mit.edu/scigen/
The World Multi-Conference on Systemics, Cybernetics and Informatics (SCI)
4
Sep. 22, 2006 7
Annoyance…
Sep. 22, 2006 8
“Dong-Won Lee” as PC?
WMSCI2006
5
Sep. 22, 2006 9
Some Known Questionable VenuesFrom http://www.inesc-id.pt/~aml/trash.html:
IMCSE: International Multiconference in Computer Science and Computer Engineering WMSCI or SCI: World Multiconference on Systemics, Cybernetics and Informatics ICCCT: International Conference on Computing, Communications and Control Technologies PISTA: Conference on Politics and Information Systems: Technologies and Applications SSCCII: Symposium of Santa Caterina on Challanges in the Internet and Interdisciplinary Research CITSA: International Conference on Cybernetics and Information Technologies, Systems and Applications ISAS: International Conference on Information Systems Analysis and Synthesis CISCI: Conferencia Iberoamericana en Sistemas, Cibernética e InformáticaSIECI: Simposium Iberoamericano de Educación, Cibernética e InformáticaWCAC: World Congress in Applied Computing Any IPSI International Conference or journal Any GESTS international conference or journal KCPR: International Conference on Knowledge Communication and Peer Reviewing International e-Conference on Computer Science …
http://fakeconferences.org => down from a threat
Sep. 22, 2006 10
Fakes Everywhere
Microsoft HoneyMonkey
6
Sep. 22, 2006 11
Fake VenuesAccording to fakeconferences.org,
“… fake venues are ones that are organized for the revenue, not for the advancement of science…They share a lot in common…an abundance of varying, vaguely connected topics, high frequency of conference, spam mailings, obscure organizers and sponsors, and poor peer reviewing and randomly accepting papers …”
WMSCI has listed close to 300 research topics as relevant in its Call-For-Paper (CFP), and reportedly accepted 2,165 and 2,904 papers in 2003 and 2004, respectively
Sep. 22, 2006 12
Differences in DisciplinesComputer Science
Peer-reviewed conferencesTop conferences have 5-15% acceptance rateSpecialized and small conferences (attendance of 500+)Often value conferences > journals
Pure Sciences (eg, Math, Physics)Pre-print at Arxiv.orgRigorous reviews for journalsHuge flagship conference (ICM 98 attracted ~4000)
Social SciencesOften value journals > conferencesConferences are mostly for gathering or short abstract based screeningRigorous reviews for journals
7
Sep. 22, 2006 13
Outline
MotivationSimple StudyResultsSummary
Sep. 22, 2006 14
Research Question
Can we detect the so called “fake venues” automatically?
DesiderataLarge-number of venues per year scalableAutomatic detection
no human involvementFalse positives >> false negatives
0
100
200
300
400
500
600
700
800
1999 2000 2001 2002 2003 2004 2005 2006
Histogram of CFPs in dbworld
8
Sep. 22, 2006 15
Candidate Features
Good vs. bad venuesCitation counting (eg, Impact Factor)Acceptance rateReputation (eg, society)History…
At the end, none satisfy our desiderata. Need something else…
Sep. 22, 2006 16
Research Hypothesis
PC member list can be readily available from CFP data extraction + data cleaningEach CFP has only finite number of PCs scalabilityExamine quality of PC w.r.t heuristics:
Classification detected two:The 2nd International Advanced Database Conference (IADC)The 4th International Conference on Computer Science and its Applications (ICCSA)
Not part of original Q
14
Sep. 22, 2006 27
PSU PrankApr. 10, 2006, we generated 3 bogus papers using MIT SCIgen software:
P1 by Ethan PatelP2 by Simon R. HathawayP3 by Richard Zhang