this post was submitted on 30 Jan 2024
65 points (94.5% liked)

sh.itjust.works Main Community

7718 readers
1 users here now

Home of the sh.itjust.works instance.

Matrix

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] RightHandOfIkaros@lemmy.world 33 points 9 months ago (3 children)

We built a data set of 45 million comments on news articles on the Huffington Post website between January 2013 and February 2015.

I am no expert but I feel like this is a really bad data set choice for this study.

[–] KuroeNekoDemon@sh.itjust.works 9 points 9 months ago (1 children)

It is. They should've used Reddit and Twitter posts/comments from it's start to the present to get a more accurate database

[–] awwwyissss@lemm.ee 1 points 9 months ago

Or from the start up until like 2016 when the shills and bots started showing up en masse.

[–] allo@sh.itjust.works 1 points 8 months ago

we built a dataset of three of my comments and found that...

[–] MomoTimeToDie@sh.itjust.works 1 points 9 months ago (1 children)

It's just a bad data set for basically anything

[–] sugar_in_your_tea@sh.itjust.works 5 points 9 months ago* (last edited 9 months ago)

Yup, comments on news articles are pure cancer. Comments about news articles can be decent though, but they need to be hosted elsewhere.