Analysis of some newsgroups traffic
by Nasser M. Abbasi, Updated Nov 4, 2012
Data was imported from Google newsgroup. I could not obtain data for R so I do not have it here.
Plots are made for the following groups
group | URL |
fortran | http://groups.google.com/group/comp.lang.fortran/about |
matlab | http://groups.google.com/group/comp.soft-sys.matlab/about |
mathematica | http://groups.google.com/group/comp.soft-sys.math.mathematica/about |
maple | http://groups.google.com/group/comp.soft-sys.math.maple/about |
scilab | http://groups.google.com/group/comp.soft-sys.math.scilab/about |
symbolic | http://groups.google.com/group/sci.math.symbolic/about |
ada | http://groups.google.com/group/comp.lang.ada/about |
python | http://groups.google.com/group/comp.lang.python/about |
lisp | http://groups.google.com/group/comp.lang.lisp/about |
java | http://groups.google.com/group/comp.lang.java.programmer/about? |
ruby | http://groups.google.com/group/comp.lang.ruby/about |
cobol | http://groups.google.com/group/comp.lang.cobol/about |
scheme | http://groups.google.com/group/comp.lang.scheme/about |
sagedevel | http://groups.google.com/group/sage-devel/about |
dsp | http://groups.google.com/group/comp.dsp/about? |
num-analysis | http://groups.google.com/group/sci.math.num-analysis/about? |
tex | http://groups.google.com/group/comp.text.tex/about? |
math | http://groups.google.com/group/sci.math/about? |
physics | http://groups.google.com/group/sci.physics/about? |
javascript | http://groups.google.com/group/comp.lang.javascript/about? |
c++ | http://groups.google.com/group/comp.lang.c++/about? |
c++ moderated | http://groups.google.com/group/comp.lang.c++.moderated/about? |
comp.lang.functional | http://groups.google.com/group/comp.lang.functional/about? |
maxima | http://www.math.utexas.edu/pipermail/maxima/ |
Show trends of each group on its own, y-axis is average number of posts per months, x-axis is month number starting the count from from 1988. Notice some groups are older than others.
Make bar chart showing average posts per month for each group
Show all traffic on one plot
How was the plots generated?
The data above was obtained for each system. Each entry in the list is the number of posts per month. Matlab started in 1993. Fortran in 1988, The Other systems did not start until later. A list plot was generated for each data, comparing systems against each others.
Sage has 2 groups. Support and Developer. It was suggested that only Support be used. Hence in the plots below, only Support is used. Average number of posts per month is also compared between all the systems. Data used is cut off at end of October 2010. Does not include November 2010.
Plots are all shown below.
Function to download the data
Thanks goes to WReach for providing the Mathematica code to help with a problem downloading the data from Google usenet. The post is here.
Thanks also for Mark McClure for providing original function and the parsing for the data.. These made downloading data much easier.
Get all the data into arrays and process