Presentation is loading. Please wait.

Presentation is loading. Please wait.

What are developers talking about? AN ANALYSIS OF TOPICS AND TRENDS IN STACK OVERFLOW DENNIS PORTENGEN.

Similar presentations


Presentation on theme: "What are developers talking about? AN ANALYSIS OF TOPICS AND TRENDS IN STACK OVERFLOW DENNIS PORTENGEN."— Presentation transcript:

1 What are developers talking about? AN ANALYSIS OF TOPICS AND TRENDS IN STACK OVERFLOW DENNIS PORTENGEN

2 Authors Anton Barua (pursuing MSc. Computing Science) Stephen W. Thomas (PhD Computing Science) Dr. Ahmed E. Hassan (Business)

3 Goal of the paper “Uncovering the main discussion topics, their underlying dependencies, and trends over time.” (Barua et al., 2012) 4 RQs What are the main discussion topics? Does a question in one topic trigger answers in another? How does developer interest change over time? How do the interest in specific technologies change over time?

4 Main topics in article Topic modelling Uses word-frequencies and co-occurence frequencies to build a model of related words LDA (Latent Dirichlet Allocation) Statistical technique that creates topics of sets of words in a document Simple idea: ‘Planet’, ‘Space’, ‘Star’, ‘Orbit’ indicates that topic is related to astronomy

5 Research Methodology

6 PDD

7 Example Result of pre-processing Before pre-processingAfter pre-processing I’ve been having issues getting C sockets API to work properly in C++. Specifically, although I am including sys/socket.h, I still get compile time errors telling me that AF_INET is not defined. Am I missing something obvious, or could this be related to the fact that I’m doing this coding on z/OS and my problems are much more complicated? Issu c socket api work properly c++ specif include sy socket.h compil time error af_inet defin miss obvious relat fact code z os problem complic

8 Example output of LDA

9 Related Literature Categorized in 4 fields The general study of Q&A websites The study of Stack Overflow specifically The study of other social platforms for developers The use of LDA to study trends in software engineering data Difference with these studies Aimed at the textual context generated by users instead of user activity

10 Opinion STRONG POINTS Qualitative and quantitave techniques Large dataset Methodology applicable to other developer resources WEAK POINTS Methodology does not incorporate predictive model Experimentation with K value and value of treshold δ

11 Question time!

12


Download ppt "What are developers talking about? AN ANALYSIS OF TOPICS AND TRENDS IN STACK OVERFLOW DENNIS PORTENGEN."

Similar presentations


Ads by Google