(2/2)
With such schema/query changes it is possible now to partition objects_data/activities_data tables at any granularity.
For example, for activites, on yearly basis:
And it would work transparently. Partitions can be either added manually for upcoming years, or added on-demand in insert trigger.
On eientei DB snapshot such partitioning takes 1-3 hours.
Naturally, due to attributes denormalization there will be some overhead, but a majority of old partitions could now be extracted to separate tablespace (on another disk/raid entirely, e.g. to hdd freeing up ssd space).
For example using eientei numbers, in (data|indexes)
before any schema changes:
- objects (65G|22G)
- activities (79G|67G)
after schema change and partitioning:
- objects_attributes (7G|8G)
- objects_2020 (1M|1M)
- objects_2021 (8G|2G)
- objects_2022 (15G|4G)
- objects_2023 (15G|4G)
- objects_2024 (21G|6G)
- objects_2025 (3G|1G)
- activities_attributes (47G|53G)
- activities_data_2020 (1M|1M)
- activities_data_2021 (6G|500M)
- activities_data_2022 (12G|1200M)
- activities_data_2023 (15G|1500M)
- activities_data_2024 (21G|6G)
- activities_data_2025 (4G|300M)
which is a ~15% total overhead, but ~60% of total size can be moved to hdd for cold online storage by doing:
create tablespace archive location '/mnt/coldstorage/postgres/archive'; alter table activities_data_2020 set tablespace archive; alter table activities_data_2021 set tablespace archive; alter table activities_data_2022 set tablespace archive; alter table activities_data_2023 set tablespace archive; alter table activities_data_2024 set tablespace archive; ... alter table objects_2020 set tablespace archive; alter table objects_2021 set tablespace archive; alter table objects_2022 set tablespace archive; alter table objects_2023 set tablespace archive; alter table objects_2024 set tablespace archive; ...while keeping only current year on fast ssd.