Conversation
Notices
-
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 02:56:31 JST
Phantasm
fedi-pleroma-maintenance-brain.webp- Doughnut Lollipop 【記録係】:blobfoxgooglymlem:, snacks and Johnny Peligro like this.
-
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 05:17:11 JST
Phantasm
@lain Verita is running that task daily and switches Postgres into and from replication mode on the fly, because it is supposed to help it go quicker. Johnny Peligro likes this. -
Embed this notice
lainy (lain@lain.com)'s status on Tuesday, 30-Sep-2025 05:17:12 JST
lainy
@phnt i TOLD him not to do the stupid object removal taks Johnny Peligro likes this. -
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 05:17:28 JST
Phantasm
@i @lain The run it, but the Pleroma DB bloat is largely a meme from lack of any maintenance. Nyanide was around 80GB after 2 years; my test instance was 4GB after consuming posts from my follows with zero relays for a year, my instance is ~30GB after 2.5 years of being subscribed to largest Pleroma instances relays and Ryona Agency is I think also around the 80GB mark after ~3 years with mostly only bot spam deleted. My instances and Ryona reject deletes.
I ran the task on my test instance and it halved the size to around 2.2GB. It works.
It is also manageable. I pay 9.90 USD for this 120GB garbage IO box and most of it is for my Git mirrors.Johnny Peligro likes this. -
Embed this notice
:blank: (i@declin.eu)'s status on Tuesday, 30-Sep-2025 05:17:30 JST
:blank:
@lain @phnt what's the alternative? can't keep buying a bigger slab forever -
Embed this notice
lainy (lain@lain.com)'s status on Tuesday, 30-Sep-2025 05:17:31 JST
lainy
@phnt this is the defrag of pleroma maintenance -
Embed this notice
Johnny Peligro (mischievoustomato@tsundere.love)'s status on Tuesday, 30-Sep-2025 05:17:49 JST
Johnny Peligro
@i @phnt @lain delete the whole thing, start again fresh on another subdomain -
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 05:20:22 JST
Phantasm
@i The answer isn't a "have you tried being less of a poor?" since I've literally told you to run the prune task if you need to. It solves the problem. If you want to archive the Fediverse, than that's your decision and disk space is part of that consideration. More archived data always equals more data. Choose one or the other.
And the prune taking too long, that's mostly an issue of shitty IO on VPSes. The prune one cawfee club took like 3 weeks and did not finish on the BuyVM crap 200 IOPS limited slab. And it finished in something like 2 days on grips'es laptop. Same with cum salon, pernia ran it on the VPS which was hosted I think on Oracle. And I assume that IO is also shit there. My IO is also shit on here, a repack takes like 5 hours on this ~30GB DB, because it is limited to 35MB/s.
The prune on the 4GB DB of pl.borked.technology on OVH's "secondary" disk limited to ~5MB/s took 8 hours to finish without a repack.
>not to mention needing even more space to fit a repack in the first place
pg_dump into a compressed file, dropdb pleroma, pg_restore. -
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 05:20:22 JST
Phantasm
@i Also since you are subscribed to like 100+ relays, it might be time to run delete from activities where data->>'type' = 'Announce' and split_part(data->>'actor', '/', 4) = 'relay'. I think that's the right query.
prettygood likes this. -
Embed this notice
:blank: (i@declin.eu)'s status on Tuesday, 30-Sep-2025 05:20:25 JST
:blank:
@phnt @lain the answer shouldn't be "have you tried being less of a poor?", cum.salon couldn't even finish the default db prune in two weeks, not to mention needing even more space to fit a repack in the first place
at least a reinstall doesn't brick the domain forever anymore -
Embed this notice
prettygood (prettygood@socially.drinkingatmy.computer)'s status on Tuesday, 30-Sep-2025 05:48:34 JST
prettygood
@phnt @i reminds me I need to share my maintenance script. I've been pruning old posts and clearing the relay activities weekly (yeah I know its turbo aggressive) during a scheduled downtime and my disk usage is very modest. -
Embed this notice
prettygood (prettygood@socially.drinkingatmy.computer)'s status on Tuesday, 30-Sep-2025 06:01:13 JST
prettygood
@phnt @i hmm. I dunno if Linode is that shitty. I should look at some storage latency stats or something. Hell I could get the storage and just set up a replica writing to it and compare, that's valid. -
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 06:01:14 JST
Phantasm
@prettygood @i The bolt-on storage usually has shittier performance than the main storage anyway. You would have to do a complete reinstall and set up lvmcache for it to probably work reasonably.
Like running Pleroma on the BuyVM slab is basically impossible after few months, because it is that limited. Just loading FE would probably kick it over for a few minutes. Same with search. -
Embed this notice
prettygood (prettygood@socially.drinkingatmy.computer)'s status on Tuesday, 30-Sep-2025 06:01:15 JST
prettygood
@phnt @i my instance runs on a very heavily taxed VPS and I'm too cheap to buy bolt-on storage just to move the postgres DB onto it. I interact with things I want to keep around. -
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 06:01:16 JST
Phantasm
@prettygood @i If I was to delete objects of the image bots I follow that have zero interactions, I would also probably cut down the DB size by a third. I don't run the prune task mostly because I like to search for posts from a year ago that are relevant $today in some way. And I can handle the data.
LVM-VDO with lz4 compression would also help with disk space usage. -
Embed this notice
(mint@ryona.agency)'s status on Tuesday, 30-Sep-2025 20:35:46 JST
@iamtakingiteasy @i @w0rm @phnt @prettygood How does that impact post editing or any other activities resulting in object change if said object is on the archived partition? prettygood likes this. -
Embed this notice
Yukkuri (iamtakingiteasy@eientei.org)'s status on Tuesday, 30-Sep-2025 20:35:51 JST
Yukkuri
@phnt @i @w0rm @prettygood Here; also pushed commit inverting known activity types to the exclude_type filter, so an index would be used during post deletes.
https://eientei.org/objects/fafbe44b-51a9-469e-a9cc-95a4d877693c -
Embed this notice
Phantasm (phnt@fluffytail.org)'s status on Tuesday, 30-Sep-2025 20:35:55 JST
Phantasm
@w0rm @i @prettygood You can also do partitioned tables with Pleroma where old posts live on slow storage and new posts live on fast storage. It requires some Pleroma patches though. @iamtakingiteasy did just that semi-recently. -
Embed this notice
di0nysius the patomskyite (w0rm@dsmc.space)'s status on Tuesday, 30-Sep-2025 20:35:57 JST
di0nysius the patomskyite
@phnt @i @prettygood
A) Post ephemerality
B) VPS wg backhaul because storage is actually cheap.