Statistical analysis of the reports archive
|
2016-06-06, 03:26
(This post was last modified: 2016-06-07 12:42 by Tingle.)
Post: #1
|
|||||||
|
|||||||
Statistical analysis of the reports archive
I thought it would be interesting to analyse the reports from the archive section to see what hidden gems can be found from the data... Some of it's surprising, some of it is nearly on par with my predictions before conducting the analysis and the rest may just be coincidental!
2963 reports were analysed from 3rd June 2016 to the 3rd October 2011, though in some of the below graphs June 2016 and/or the entirety of 2016 and/or the entirety of 2011 were left out so as to not distort the overall picture (as only partial data was available for 2011 and we're only half way through 2016). I've indicated where this has had an effect. You will find some ex-TC names in the data, I chose not to remove them for simplicity. Spoilers have been used so you can choose what you want to see. Relationship between the probability of an admin taking action and the number of posts in the thread Spoiler (Click to View) Admins most/least likely to take action when dealing with a report Spoiler (Click to View) Proportion of reports resulting in action being taken (overall) Spoiler (Click to View) Top 10 reporters by volume Spoiler (Click to View) Most and least successful reporters (top/bottom 10) of other users Spoiler (Click to View) Top 10 admins to respond to reports by volume Spoiler (Click to View) Top 10 admins to respond to reports by speed Spoiler (Click to View) Monthly/yearly average of report handling times (e.g. the time it takes for a report to be archived from the day it's created) Spoiler (Click to View) Yearly Report Figures Spoiler (Click to View) Testing monthly report numbers for normality Spoiler (Click to View) This thread wouldn't be complete without a couple of word clouds and so I've made word clouds for the reporting/original posts of the ~2900 report threads as well as a word cloud for the final post by the admin. Word clouds are a 'visual representation of text data' with the frequency of words represented by their size. For some reason the R package I used to produce the word cloud has trimmed a few of the letters from the ending of certain words, but it's still understandable. Word cloud of the 'reporting' posts (ie the first post of a report thread outlining the reasons for the report, their username and so on): Spoiler (Click to View) Word cloud of the 'admin decision' posts (ie the last post of a report thread where a decision on the report is made by an admin): Spoiler (Click to View) That's it! I hope at least some parts of this post appealed to you. I scraped the data together using the lite mode of the forum for efficiency (if you can use the words 'efficiency' and 'scrape' in the same sentence), so if you'd like a copy of the dataset I've created do let me know. |
|||||||
|
|||||||
« Next Oldest | Next Newest »
|
Possibly Related Threads... | |||||
Thread: | Author | Replies: | Views: | Last Post | |
My Reports | Tomss | 1 | 1,358 |
2021-03-06 21:28 Last Post: Carl |
|
![]() |
Corby Bug Reports | Pete | 8 | 6,392 |
2016-01-31 09:31 Last Post: Pete |
User(s) browsing this thread: 2 Guest(s)