You are viewing a single thread.
View all comments
39 points

We built a data set of 45 million comments on news articles on the Huffington Post website between January 2013 and February 2015.

I am no expert but I feel like this is a really bad data set choice for this study.

permalink
report
reply
12 points

It is. They should’ve used Reddit and Twitter posts/comments from it’s start to the present to get a more accurate database

permalink
report
parent
reply
2 points

Or from the start up until like 2016 when the shills and bots started showing up en masse.

permalink
report
parent
reply
2 points

It’s just a bad data set for basically anything

permalink
report
parent
reply
6 points
*

Yup, comments on news articles are pure cancer. Comments about news articles can be decent though, but they need to be hosted elsewhere.

permalink
report
parent
reply

we built a dataset of three of my comments and found that…

permalink
report
parent
reply

sh.itjust.works Main Community

!main@sh.itjust.works

Create post

Home of the sh.itjust.works instance.

Community stats

  • 264

    Monthly active users

  • 389

    Posts

  • 5.3K

    Comments