Tracking data changes vs schema changes

tempire · 4 October 2024 20:55

I would like to track changes to records based on whether there’s a history entry.

If there’s a schema change, however, it still records a history entry, as you would expect.

I’m looking for advice on how others have handled this particular issue - whether by building abstractions or by convention.

This other topic touches on it, however I’m not looking for future use, I need to come up with a solution in the short term.

My current thought is to store a key for the type of data change, and query that key as a matter of convention going forward. As such, I have a question about the internal performance of querying - does it retrieve the entire document if I’m only querying one key?

refset · 7 October 2024 12:20

Off the top of my head there are a few options (in no particular order):

Version your tables, e.g. such that you have products1 and later products2 when the schema needs to change - this approach would work better once XT2 supports VIEWs (which can help with avoiding pushing too much of that complexity into downstream queries)
Use new IDs and capture the schema version as an additional key(/column/attribute) and filter on it. However, this feels less than ideal from a modelling and performance perspective
Version all individual keys(/columns/attributes) and track the set of ‘current’ keys for the current schema separately. e.g. given product_name1, product_name2, and product_tax4, the current schema might only be product_name2 and product_tax4

No, XTDB should be able to accelerate all queries against top-level keys(/columns/attributes) - and this is true of both v1 and v2 (although the underlying mechanics are quite different!)

It might be simpler to approach this topic from the perspective of the requirements of your queries - can you say/illustrate more about what you’re needing?

Also, just in case you hadn’t seen this before: Datomic - The Ten Rules of Schema Growth

Topic		Replies	Views
Tracking schema changes bitemporally Users v2	3	79	7 October 2024
(V2) Best way to handle frequent updates that might not contain any changes? Users v2	13	337	12 September 2024
XTDBv2 available language features and schema support Users	3	125	26 September 2024
[FEATURE] "ALTER COLUMN" - like functionality Users v2 , feature	2	203	22 December 2023
Time Series vs. Bitemporal Explainer/Example Users	0	130	8 December 2024

Tracking data changes vs schema changes

Related topics