Research 101 (or How to Graduate Quickly)? IST 501 Fall 2014 Dongwon Lee, Ph.D.

Slides:



Advertisements
Similar presentations
Choosing a Journal APS Professional Skills Course: Writing and Reviewing for Scientific Journals.
Advertisements

PhD in Databases. introduction why PhD in databases? –everyone should have a reason! which topic should I choose? –usually the supervisor does it for.
CSE594 Fall 2009 Jennifer Wong Oct. 14, 2009
Instructions for PhD Students
Good Research Questions. A paradigm consists of – a set of fundamental theoretical assumptions that the members of the scientific community accept as.
A Template for Producing IT Research and Publication Hsinchun Chen.
On Writing High Quality Technical Papers VLDB Database School (China) Suzhou University, October 3, 2002 Lu Hongjun Hong Kong University of Science & Technology.
CSCD 555 Research Methods for Computer Science
CS 597 Your Ph.D. at USC The goal of a Ph.D. What it takes to achieve a great Ph.D. Courses Advisor How to read papers? How to keep up-to-date with research?
The Erik Jonsson School of Engineering and Computer Science Ph.D. in CS/SE at UTD Balaji Raghavachari Department of Computer Science University of Texas.
Preparing a Successful Graduate Student Award Application Karen Beattie, Ph.D. Assistant Professor Dept. of Medicine McMaster University
How to Create a Research PowerPoint
IMSS005 Computer Science Seminar
Writing a scientific paper Maxine Eskenazi Meeting 1 - Overall Structure and Content of a Paper.
Exploring a topic in depth... From Reading to Writing The drama Antigone was written and performed 2,500 years ago in a society that was very different.
How to do Quality Research for Your Research Paper
Preparing papers for International Journals Sarah Aerni Special Projects Librarian University of Pittsburgh 20 April 2005.
CS4042 / CS4032 – Directed Study 28/01/200925/01/2011 Digital Media Design Music and Performance Technology Jim Buckley CS1-031 (21)3531.
How to get the most out of the survey task + suggested survey topics for CS512 Presented by Nikita Spirin.
How to Research. Research Paper Assignment Identify what the assignment requires:  topic possibilities  number of sources  type of sources (journal,
Michael Arbib: How to Get a Ph.D.January How to Get a Ph.D. 1. Why get a Ph.D.? 2. Finding an Advisor 3. Screening 4. Breadth and Depth 5. What.
Researching & Writing a Literature Review Karen Ciccone NCSU Libraries.
The Erik Jonsson School of Engineering and Computer Science Dissertation and beyond: Ph.D. in CS/SE at UTD Dr. Balaji Raghavachari Department of Computer.
Finding a Dissertation/Thesis Topic Henri Casanova ICS Graduate Chair
Ian F. C. Smith Writing a Conference Paper. 2 Disclaimer This is mostly opinion. Suggestions are incomplete. There are other strategies.
REPORTING AND PUBLISHING RESEARCH FINDINGS Matthew L. S. Gboku DDG/Research Coordinator Sierra Leone Agricultural Research Institute Presentation at the.
Dr. Sundar Christopher Navigating Graduate School and Beyond: Sow Well Now To Reap Big Later Writing Papers.
Abstract  An abstract is a concise summary of a larger project (a thesis, research report, performance, service project, etc.) that concisely describes.
CS & CS ST: Probabilistic Data Management Fall 2016 Xiang Lian Kent State University Kent, OH
NSERC Coach - Dr. Steve Perlman, Dept. of Biology
How to Write a Scientific Paper
AP Computer Science Principals Course Importance and Overview
The AMSc project: what to expect and how to do it
Chapter 5.. From Problems to Sources Manal Badhfer Hawra Al-Zayer
Term Project Proposal By J. H. Wang Apr. 7, 2017.
This Week’s Agenda APA style: -In-text citation -Reference List
AVID Ms. Richardson.
How to Develop and Write a Research Paper.
CSE594 Fall 2009 Jennifer Wong Oct. 14, 2009
Ask students to write on an index card individually
How to write successfully for IATEFL Conference Selections
Experimental Psychology
Literature Surveys Source : : Keshav P. Dahal (Bradford University)
APA Format What you need to know
Introduction to IR Research
Su White Scholarly research ii COMP1205 Su White Date.
Strategies for Preparing for Interviews
Academic Ethics for Students
To present a paper method (technology) how to present it
Session 8 Exam techniques
Creating Research Paper Note Cards
CSC 682: Advanced Computer Security
Peer Reviews Tips for the author.
What to write and how to write it!
Experimental Psychology PSY 433
Academic Communication Lesson 3
How to Read Research Papers?
GRADUATION PROJECT DIPLOMA IN PHARMACY
How do I research effectively? Part 2
CSCD 506 Research Methods for Computer Science
Introduction of the Research Paper
I've Got To Write A Research Paper ! ! !.
Technical Writing Abstract Writing.
Writing Essays.
Unit 3 Introduction.
Conducting a STEM Literature Review
Ask students to write on an index card individually
A Template for Producing IT Research and Publication
Academic Debate and Critical Thinking
CSE594 Fall 2009 Jennifer Wong Oct. 14, 2009
Presentation transcript:

Research 101 (or How to Graduate Quickly)? IST 501 Fall 2014 Dongwon Lee, Ph.D.

IST Justification I am probably a qualified person to give a talk on this topic… because I’m still STRUGGLING to publish Yes, I still do get rejections I’m still learning from failures What’s being presented here is purely my suggestion (+ some other colleagues) Take it or leave it – up to you !!

IST TOC How to start? How to find research problems? How to write research papers? How/Where to submit? Ethics Misc.

What is “Ph.D.”? 4 IST 501

Publish or Perish has to be written first has to validated as novel has to be published To get a good job, have to have many & good papers… IST 501 5

6 The Goal of Research Papers Disseminate your ideas to others so that people appreciate/use/cite them Graduate… Of course MS: need to write a thesis to graduate… Ph.D.: “Publish or Perish” Without good publications… No good job, no good career And possibly no good life either GPA: nobody cares about PhD’s GPA Maintain about 2.0/4.0

IST Where to Start? Given that you have acquired basic theory/knowledge/tools from classes and books… Next, first thing to learn: Read others’ papers Critique and evaluate them Which paper to read? Start from good ones Classical ones Ones from good journals or conferences

IST Where to Start: eg, Databases DB Conferences/Symposiums/Workshops (81) ADB, ADBIS, ADBT, ADC, ARTDB, Berkeley Workshop, BNCOD, CDB, CIDR, CIKM, CISM, CISMOD, COMAD, COODBSE, CoopIS, DAISD, DANTE, DASFAA, DaWaK, DBPL, DBSEC, DDB, DDW, DEXA, DIWeB, DMDW, DMKD, DNIS, DOLAP, DOOD, DPDS, DS, EDBT, EDS, EFIS/EFDBS, ER, EWDW, FODO, FoIKS, FQAS, Future Databases, GIS, HPTS, IADT, ICDE, ICDM, ICDT, ICOD, IDA, IDC(W), IDEAL, IDEAS, IDS, IGIS, IWDM, IW-MMDBMS, JCDKB, KDD, KR, KRDB, LID, MDA/MDM, MFDBS, MLDM, MSS, NLDB, OODBS, OOIS, PAKDD, PKDD, PODS, RIDE, RIDS, RTDB, SBBD, SDM-SIAM, Semantics in Databases, SIGMOD, SSD, SSDBM, SWDB, TDB, TSDM, UIDIS, VDB, VLDB, WebDB, WIDM, WISE, XP, XSym DB Journals (19) ACM TODS, ACM TOIS, DKE, Data Base, DMKD, DPD, IEEE Data Eng. Bulletin, IEEE TKDE, Info. Processing and Management, Info. Processing Letters, Info. Sciences, Info. Systems, J. of Cooperative Info. Systems, J. of Database Management, JIIS, KAIS, SIGKDD Explorations, SIGMOD Record, VLDB J. The list excludes Information Retrieval and Digital Library

IST Where to Start? Some good ones: DB: SIGMOD, VLDB, ICDE, EDBT DB Theory: PODS, ICDT Data Mining: KDD, ICDM, SDM, PKDD Modeling: ER Information Retrieval: SIGIR, CIKM, ECIR Digital Library: JCDL, ECDL, CIKM Web: WWW, WSDM Social: ICWSM, WebSci

IST Reference Chase Don’t trap into the “Exponential Reference Chase” problem References that you use Papers to read in queue

IST Symptoms After chasing relevant works that are increasing super-exponentially fast, you might feel… All relevant problems are ALREADY studied by someone else Others have history: Mathematics, Art, … Problem is too BROAD for me to tackle Divide-n-conquer

IST TOC How to start? How to find research problems? How to write research papers? How/Where to submit? Ethics Misc.

IST Finding DARN Research Problems? Easy but non-helpful answer: Read and think and read and think and… Subjective but MAYBE-helpful answer MAP approach MATRIX approach DELTA approach DROP approach What I Call M2D2

IST MAP Approach To start a research, initially, you have to read a lot of papers anyway While reading those, why don’t you analyze and summarize what you’ve read and put them into your own wording? Good for a survey paper – a MAP for future readers To be publishable, your survey must have novel view-point, taxonomy, comprehensive analysis, or all of them Good target: ACM Comp. Survey, ACM C.ACM, IEEE Computer, JASIST, …

IST MATRIX Approach Now, You have read a lot of papers Draw a MATRIX on a specific problem, and map the paper that you red to cells of matrix At the end, non-filled cell is the missing work that no one has done But wait… first make sure that: The hole is worthwhile to fill in Doable (good as my dissertation topic?) Value (what’s good?)

IST Eg, XML-Relational Conversion Problem Schem a Const raint QueryViewTrigger s Securit y Top-KTempo ral Spatial XML  Relati onal O (40+) O (5+) OOOO Relati onal  XML OOOO Around 2000

IST DELTA Approach Arguably easiest… Pick one paper of your interest Read a lot – more than 10 times Find limitations and Extend it by DELTA Prove or demonstrate that The limitation that you pointed out is valid Your suggestion improved the problem by DELA The more well-known work you choose, the harder to improve, but the better for your reputation… Eg, “E.F. Codd’s relational model is insufficient to handle semi-structured model because…” The bigger the DELTA is, the better your paper gets

IST Eg, The optimal wedding problem When a person has a chance to date K persons, the optimal wedding algorithm is: Date upto K/3 persons Let the best person among K/3 as B using a criteria C Start dating again from K/3+1 person, p If p is better than B using C Stop and Marry p Otherwise, keep dating till K-th person How many ways can we improve this algo?

IST Possible DELTAs Parameters fitting: How to determine K? Estimate? How to determine C? Comparison? Scalability? K=10 vs. K=100,00? Sub-optimal? Question the assumptions: Monogamy vs. Polygamy vs. N-gamy? (How to find n th best spouse fast?) Data distribution? Uniform/Poisson/Scale-free Application to another domain? System building? …

IST Which DELTA to Choose Pick the DELTA that is the most significant Some criteria are: Have practical values Has motivational scenario as of NOW, or Predicted to be useful in N years Non-trivial Hot topics: Streaming, XML, Sensor, …

IST DROP Approach (adopted from J. Widom’s slides) Pick a simple but fundamental assumption of existing theory/model/systems/methods DROP it Reconsider to see how the drop affects all aspects of the existing theory/model/systems/methods Many Ph.D. theses From

IST Eg, Two Stanford DB Projects The LORE Project Dropped assumption: “Data has a fixed schema declared in advance” Semi-structured data (→ XML) Semi-structured data (→ XML) The STREAM Project Dropped assumption: “First load data, then index it, then run queries” Continuous data streams (+ continuous queries) Continuous data streams (+ continuous queries) From

IST TOC How to start? How to find research problems? How to write research papers? How/Where to submit? Ethics Misc.

IST Facts on Paper Reviews (adopted from J. Cho’s slides) 3-4 reviewers per paper 10-20% acceptance rate for top-tier venues Very competitive Criteria Accept/Weak Accept Neutral Weak Reject/Reject One reject kills a paper At least Accept, Weak Accept and Neutral

IST About Reviewers papers per reviewer (for top conferences) Reviewer cannot spend 5-10 hours per paper 20 X 10 = 200 hours = (40 hours X 5) = 5 weeks! No reviewers can afford this Give a good impression in 1-2 hours! Impression matters the most Content comes next! Reviewer do NOT get paid  no motivation to do extra work WARNING: Of course, to start with, your main idea must be good to get into top-tier…

IST Good Impression in 1-2 hours? 1. Good introduction Everyone reads it If not interesting, people stop reading 2. Easy to read People should understand what you say Easy to confuse, difficult to understand 3. Build an excitement and a strong case What is good? 4. Broad reference Sometimes kills a paper Program committee members

Good Introduction Sells IST Excerpt from “How to do good research, get it published” by Eamonn Keogh

Good Introduction Sells IST Excerpt from “How to do good research, get it published” by Eamonn Keogh

Good Introduction Sells IST Excerpt from “How to do good research, get it published” by Eamonn Keogh

IST How to Write an Introduction 1. Start with 5 bullets What’s the problem? Why is it interesting? … sentence answer to each question 3. Add more content 4. Spend enough time on introduction 1. Bullet points enough

IST Easy-to-Read Paper You can always make it complicate later 1. Lots of examples 2. Figures & Tables – Figure speaks !! Summary of notations 3. Define assumptions/models/architecture precisely Explicitly write down assumptions Input, output, property, goal function 4. Make a connection Why this experiment?

IST Paper Organization (10 pages) 1. Introduction (2 pages) 2. Related Work (half page) 3. Framework (2 pages) 4. Main Ideas (3 pages) 5. Experiments (2 pages) 6. Conclusion (half page) 7. References (half page) Actual idea – only 3 pages!!! Even tiny idea can turn into a good paper if you DEVELOP it well

Short Main Idea Watson & Crick’s Nature paper on double helical structure of DNA is only 1 page (+ 1 paragraph) long IST

IST Start Writing Early On… Even if you feel you are NOT ready yet Your advisor will throw away your initial draft anyway Your initial submission will be rejected anyway But you get: (good or bad) Experiences and learn from that Writing sharpens your ideas and gives more ideas Writing can be improved only via writing

IST TOC How to start? How to find research problems? How to write research papers? How/Where to submit? Ethics Misc.

IST Where to Submit? Top-down Aim at the best venue in the field If rejected, go to next-tier venue If rejected, go to next… Bottom-up Aim at workshop If accepted, work more and aim at better one (symposium or 2 nd -tier conference) After making sure that the ideas mature enough, aim at the best conference or journal

IST How to assess venue quality? What venues are reputable? What can be said about questionable ones?

IST Avoid Known Fake Venues From (no longer avail) IMCSE: International Multiconference in Computer Science and Computer Engineering WMSCI or SCI: World Multiconference on Systemics, Cybernetics and Informatics ICCCT: International Conference on Computing, Communications and Control Technologies PISTA: Conference on Politics and Information Systems: Technologies and Applications SSCCII: Symposium of Santa Caterina on Challanges in the Internet and Interdisciplinary Research CITSA: International Conference on Cybernetics and Information Technologies, Systems and Applications ISAS: International Conference on Information Systems Analysis and Synthesis CISCI: Conferencia Iberoamericana en Sistemas, Cibernética e Informática SIECI: Simposium Iberoamericano de Educación, Cibernética e Informática WCAC: World Congress in Applied Computing Any IPSI International Conference or journal Any GESTS international conference or journal KCPR: International Conference on Knowledge Communication and Peer Reviewing International e-Conference on Computer Science …

IST Differences in Disciplines Computer Science Peer-reviewed conferences Top conferences have 5-15% acceptance rate Specialized and small conferences (attendance of 500+) Often value conferences > journals Pure Sciences (eg, Math, Physics) Pre-print at Arxiv.org Rigorous reviews for journals Huge flagship conference (ICM 98 attracted ~4000) Social Sciences Often value journals > conferences Conferences are mostly for gathering or short abstract based screening Rigorous reviews for journals39

My Own Endeavor Oracle, Where Shall I Submit My Papers?, Ergin Elmacioglu, Dongwon Lee, In ACM Comm. of the ACM (CACM), Vol. 52, No. 2, page , February 2009 Measuring Conference Quality by Mining Program Committee Characteristics, Ziming Zhuang, Ergin Elmacioglu, Dongwon Lee, C. Lee Giles, In ACM/IEEE Joint Conf. on Digital Libraries (JCDL), page , Vancouver, BC, Canada, June 2007 IST Studied a data mining technique to detect fake conferences Precision=93%, Recall=96% Used PC (Program Committee) as the main feature

IST Classification with Decision Tree Reputable ? training set PC has feature A? Yes No PC has feature B? PC has feature C?PC has feature D? Reputable venue Disreputable venue p pp pq q qq

IST PSU’s MIT Emulation Apr. 10, 2006, we generated 3 bogus papers using MIT SCIgen software: P1 by Ethan Patel P2 by Simon R. Hathaway P3 by Richard Zhang P1 P2

IST Measuring Paper Authenticity Indiana’s Inauthentic Paper Detector says: P1: 28.9% => inauthentic P2: 61.5% => authentic P3: 38% => inauthentic

IST Results of Our Experiment Conference A and B April 24 – May 1, 2006 P1 was submitted to Conf A on April 24 P2 was submitted to Conf B on April 26 P3 was submitted to Conf A on May 1 May 15, 2006 P1 and P2 accepted w/o reviews P3 rejected w/o reviews Asked for reviews or any rationale  no response

IST “Ethan Patel” made it !

IST TOC How to start? How to find research problems? How to write research papers? How/Where to submit? Ethics Misc.

IST Fabrication From Wikipedia… “Fabrication, in the context of scientific inquiry and academic research, refers to the act of intentionally falsifying research results, such as reported in a journal article. Fabrication is considered a form of scientific misconduct, and is regarded as highly unethical. In some jurisdictions, fabrication may be illegal…”

IST Plagiarism From Wikipedia “… According to Diana Hacker, "Three acts are plagiarism: (1) failing to cite quotations and borrowed ideas, (2) failing to enclose borrowed language in quotation marks and (3) failing to put summaries and paraphrases in your own words…"

IST Eg, Fabrication and Plagiarism “Prominent Physicist Fired for Faking Data Research: Bell Labs says scientist 'recklessly' misrepresented work on microprocessors…” (2002, LA Times) physicist26sep26.story physicist26sep26.story “Constantinos V. Papadopoulos got caught plagiarism at EUROPAR (1995)… 7 papers published and 8 under submission… all plagiarized from Technical Reports…” NEVER, EVER, do these – professional suicide !!

IST TOC How to start? How to find research problems? How to write research papers? How/Where to submit? Ethics Misc.

IST Personal Research Log Maintain personal research log Sketch your research ideas into a writing Update your ideas as time passes Occasionally go back to old writings Prepare a short review for each paper that you read Summary Pros and cons Limitations or problems If needed, contact authors and ask questions Usually authors are willing to discuss with their readers

IST Professional Society Be a member of your professional society early on Ask your advisor to support membership Use the mentor program of societies

IST References (available at) [2002] How to write a paper?, Junghoo Cho, UCLA [1996] David Dill's Advice on Choosing an Advisor (or) How to Survive as a Grad Student, David DillDavid Dill's Advice on Choosing an Advisor (or) How to Survive as a Grad Student [1996] How to Survive as a Graduate Student, Brian Noble, David Dill, Benli Pierce, Jay Sipelstein, Jonathan ShewchuckHow to Survive as a Graduate Student [1997] How to Choose a Thesis Advisor, Michael C. LouiHow to Choose a Thesis Advisor [????] How to have your abstract rejected, Mary-Claire van Leunen and Richard LiptonHow to have your abstract rejected [1994] Dissertation Advice, Olin ShiversDissertation Advice [1999] Advice for Finishing that Damn Ph.D., Daniel M. BerryAdvice for Finishing that Damn Ph.D. [1999] So long, and thanks for the Ph.D.!, Ronald T. AzumaSo long, and thanks for the Ph.D.! [2001] How to Have a Bad Research Career, David A. PattersonHow to Have a Bad Research Career [2012] How to do good research, get it published, Eamonn Keogh, SIAM SDM Tutorial