I made a Lemmy instance with a custom algorithm that keeps only the top 20% most unique (=interesting?) posts. It does this by calculating a similarity score between every post on my instance and all posts that came before it. The top 80% of posts with the highest self-similarity get removed instantly.

The idea would be that this allows me to cut through the noise that’s running through the communities, similar to how xkcd-signal attempted to do 20 years ago.

The instance is mostly meant for reading, not posting. So it has a very open federation policy (for now).

If anything, this is experimental. So please let me know what you think! You can see the type of stuff that gets removed in the modlog (https://lemmy.coffee/modlog).

You are viewing a single thread.
View all comments
19 points
*

Interesting. One of my instance’s guiding philosophies is “Quality over Quantity”. I’ve taken different steps toward achieving that (defederate from the Reddit repost instances, disallow pretty much all content bots, manually/locally mod duplicate posts, etc).

Do you plan to publish your algorithm/filter? Would be interested in seeing if it could be tuned and possibly reduce some of the workload for me.

permalink
report
reply
8 points

Do you plan to publish your algorithm/filter?

In an ideal world sure. But I’d have to think about that some more, because in principle I don’t want people to game it :)

permalink
report
parent
reply
11 points

Lemmy’s license is AGPL, so you would need to at least publish changes to Lemmy itself 😉

(I don’t know if e.g. the code for the algorithm is separate, in order to have a closed source algorithm with an open source Lemmy fork)

permalink
report
parent
reply
3 points

I made no changes to the lemmy codebase, its all done through an auto-moderating bot that auto-removes posts that don’t meet the standard :)

permalink
report
parent
reply
2 points

Does GPL/AGPL require you to publish the code even if you are not selling the software? As in I could run a library computer with my custom Linux distro without giving anyone the source, but I wouldn’t be able to publish it or sell it only as binary blobs, right?

permalink
report
parent
reply

Fediverse

!fediverse@lemmy.world

Create post

A community to talk about the Fediverse and all it’s related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

  • Posts must be on topic.
  • Be respectful of others.
  • Cite the sources used for graphs and other statistics.
  • Follow the general Lemmy.world rules.

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

Community stats

  • 9.8K

    Monthly active users

  • 2.3K

    Posts

  • 84K

    Comments