Just started self hosting this instance. Nothing on the docs mentioned anything about storage considerations.

  • Ruud@lemmy.worldM
    link
    fedilink
    English
    arrow-up
    50
    arrow-down
    1
    ·
    1 year ago

    This is lemmy.world after 4 weeks:

    58G	pictrs
    34G	postgres
    
      • manitcor@lemmy.intai.tech
        link
        fedilink
        English
        arrow-up
        10
        ·
        edit-2
        1 year ago

        im running 50 users right now, subbed to A LOT of communities, seeing db growth of about 100mb per day.

          • terribleplan@lemmy.nrd.li
            link
            fedilink
            English
            arrow-up
            12
            ·
            edit-2
            1 year ago

            It’s all about how many communities your user(s) subscribe to since your instance basically acts as a mirror for those.

            My instance has been running for 23 days, and I am pretty much the only active local user:

            7.3G    pictrs
            5.3G    postgres
            

            edit: I may have a slight Reddit Lemmy problem

            • Pleonasm@programming.dev
              link
              fedilink
              English
              arrow-up
              2
              ·
              1 year ago

              So if you’re the only user (let’s assume for ease) then, that represents all the updates (posts, comments, votes) from each community that you are subscribed to?

              • terribleplan@lemmy.nrd.li
                link
                fedilink
                English
                arrow-up
                5
                ·
                1 year ago

                Yeah, and I purposely subscribe to (or sometimes have a dedicated “federation helper bot” account I run subscribe to) most of the most popular communities on the most popular instances so I can get a decent sampling of what’s going on in the fediverse on the “All” feed. So I assume my storage usage is maybe a bit higher than what an “average” single-user instance may be…

                • ipkpjersi@lemmy.one
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  1 year ago

                  Ooh, that’s a really good idea, I need a federation helper bot/account when I start self-hosting a Lemmy instance!

      • Dran@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        ·
        1 year ago

        Question if you know: does a lemmy instance have to be publically accessable to work? Like, if I make an instance on my homelab can the instance “fetch” content and serve it faster locally? Could I reply to a post and have others see it? Etc

    • HappyHam@lemm.ee
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 year ago

      Now I wonder how viable it would be to support video hosting. The answer is almost certainly “God no!”

    • lightrush@lemmy.ca
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      edit-2
      1 year ago

      Feels like this will benefit from some sort of fuzzy deduplication in the pictrs storage. I bet there are a lot of similar pics in there. E.g. if one pic or a gif is very similar to another, say just different quality or size, or compression, it should keep only one copy. It might already do this for the same files uploaded by different people as those can be compared trivially via hashing, but I doubt it does similarity based deduplication.

      • BigWigglyStyle@lemmynsfw.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        At the end of the day the vast majority of what needs to be saved is text. If media content is embedded, the the server just has to save the path to the file not the file itself.

  • Kayn@dormi.zone
    link
    fedilink
    English
    arrow-up
    7
    ·
    edit-2
    1 year ago

    My instance dormi.zone has been running for around 3½ weeks now, has a 3-digit amount of users and hosts a community with little more than 1000 subscribers. Here’s how much storage it currently takes up:

    • 6.2 GiB postgres
    • 4.9 GiB pictrs

    In the default Ansible configuration, storage will mostly be accumulated by log files that are automatically generated by Docker and deleted whenever you restart the Docker containers.

    • holycrap@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 year ago

      I really hope it doesn’t get purged if lemmy is to be a Reddit replacement. A lot of the value Reddit had was obscure knowledge and making google searches actually usable.

      • Molecular0079@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        1 year ago

        I think as long as the original community the post is in doesn’t purge the data, it’s fine for other instances to purge if necessary.

        • punio7@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Exactly, when dealing with big data, you need a strategy to archive old data. You can’t just store everything in one DB. Smaller instances may not feel like keeping all the date from all the time. Even big instances should have a mechanism to move old data do different databases.

      • Kayn@dormi.zone
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 year ago

        Are you planning on donating to instances that don’t purge old data?

  • Tugg@lemmyverse.org
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    Small instance with about 3 users and myself online for about 2 weeks.

    pictrs   930M
    postgres 1.4G
    
  • key@lemmy.keychat.org
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    1 year ago

    Depends. If you have a lot of users posting a lot of pictures and you use pictrs out of the box config, then a lot. If you are just running a few users with finite communities being synced then a lot less. The number is going to vary a lot as lemmy grows and gets older so hard to document realistic expectations. But docker images are probably going to take up more disk space than actual contents unless you get quite big. I just threw my PG volume into a tgz to move servers and it’s less than a gig.

    • bdonvr@thelemmy.club
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      1 year ago

      The lemmy.world admin said above that their instance currently takes up less than 100GB

      Though this will accrue over time I suppose.

      But for self hosting? You should be good for a long long time. The only pictures stored are the ones you upload, the rest is just text.

  • JoShmoe@lemmy.zip
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    6
    ·
    1 year ago

    Holding onto all that data is pointless if you’re not selling it to someone.

    • AFK BRB Chocolate@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      I disagree. One big hunk of value of a place like this is being able to look back at old threads. How many times did people say they always put “Reddit” in front of their Google searches to get the information they were looking for? This could be the same.

      • NaN@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        That’s a good reason for an instance to put “lemmy” in its url too, I imagine. Search engines are already returning Lemmy results for things.