Just started self hosting this instance. Nothing on the docs mentioned anything about storage considerations.

-16 points

Holding onto all that data is pointless if you’re not selling it to someone.

permalink
report
reply
8 points

I disagree. One big hunk of value of a place like this is being able to look back at old threads. How many times did people say they always put “Reddit” in front of their Google searches to get the information they were looking for? This could be the same.

permalink
report
parent
reply
3 points

That’s a good reason for an instance to put “lemmy” in its url too, I imagine. Search engines are already returning Lemmy results for things.

permalink
report
parent
reply
1 point

That’s unsustainable. Why do you think the mainstream platforms are selling out?

permalink
report
parent
reply
1 point

It’s really not, at least for the text part. Text posts and comments take almost nothing and storage continues to get cheaper.

Mainstream platforms are selling out because they’ve always had others and shareholders who ultimately want to make money.

permalink
report
parent
reply
2 points

Info is still useful for people doing google searches. It would be nice to be able to find common troubleshooting tips on Lemmy, etc.

permalink
report
parent
reply
1 point

Not everything posted here holds any value.

permalink
report
parent
reply
27 points

Is there any way to purge old data?

permalink
report
reply
35 points

I really hope it doesn’t get purged if lemmy is to be a Reddit replacement. A lot of the value Reddit had was obscure knowledge and making google searches actually usable.

permalink
report
parent
reply
21 points

I think as long as the original community the post is in doesn’t purge the data, it’s fine for other instances to purge if necessary.

permalink
report
parent
reply
3 points

Exactly, when dealing with big data, you need a strategy to archive old data. You can’t just store everything in one DB. Smaller instances may not feel like keeping all the date from all the time. Even big instances should have a mechanism to move old data do different databases.

permalink
report
parent
reply
4 points

Are you planning on donating to instances that don’t purge old data?

permalink
report
parent
reply
257 points

This is lemmy.world after 4 weeks:

58G	pictrs
34G	postgres
permalink
report
reply
4 points
*

Feels like this will benefit from some sort of fuzzy deduplication in the pictrs storage. I bet there are a lot of similar pics in there. E.g. if one pic or a gif is very similar to another, say just different quality or size, or compression, it should keep only one copy. It might already do this for the same files uploaded by different people as those can be compared trivially via hashing, but I doubt it does similarity based deduplication.

permalink
report
parent
reply

Considering this is going to be around a 5 user instance at most I think I’ll be good for awhile. Thanks!

permalink
report
parent
reply
57 points
*

im running 50 users right now, subbed to A LOT of communities, seeing db growth of about 100mb per day.

permalink
report
parent
reply
19 points

That seems high when you extrapolate that to 10000 users, like a larger instance might have.

permalink
report
parent
reply
14 points

Question if you know: does a lemmy instance have to be publically accessable to work? Like, if I make an instance on my homelab can the instance “fetch” content and serve it faster locally? Could I reply to a post and have others see it? Etc

permalink
report
parent
reply
1 point

wondering this also! wouldnt it require a domain for your account though?

permalink
report
parent
reply
17 points

Now I wonder how viable it would be to support video hosting. The answer is almost certainly “God no!”

permalink
report
parent
reply
1 point

It is viable through other hostings

permalink
report
parent
reply
11 points

Wow, that is surprisingly not bad given the size of the instance!

permalink
report
parent
reply
16 points

Honestly, Less than I thought!

permalink
report
parent
reply
11 points
*

Yeah lemmy seems to use just about nothing for data storage.

permalink
report
parent
reply
15 points

Interesting, I thought it would be waaayyy more

permalink
report
parent
reply
17 points

At the end of the day the vast majority of what needs to be saved is text. If media content is embedded, the the server just has to save the path to the file not the file itself.

permalink
report
parent
reply
2 points
*

I haven’t tried it out just yet, but I’d say… a whole lot. Depending of how popular your instance is, but… your PC will be hammered in any way.

permalink
report
reply
5 points

How many cans-of-beans.jpg can you store?

permalink
report
reply

At least 3. Maybe 4.

permalink
report
parent
reply

Selfhosted

!selfhosted@lemmy.world

Create post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Community stats

  • 5.3K

    Monthly active users

  • 3.8K

    Posts

  • 83K

    Comments