Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

POC rework threading with PostgreSQL #1990

Draft
wants to merge 213 commits into
base: postgresql
Choose a base branch
from

Conversation

chibenwa
Copy link
Contributor

@chibenwa chibenwa commented Feb 8, 2024

Appart from the first commit that I think shall be cherry-picked on master

I wanted to experiment the idea to drop the thread lookup table and instead rely on a postgreSQL index in
order not to maintain a data projection.

Kepping the data model normalised minimises the amount of writes what seems important to me on a BTree non scalable system.

@quantranhong1999 especially: thoughts? Do you wonna experiment this?

vttranlina and others added 30 commits January 11, 2024 15:34
- artifactId: apache-james-backends-postgres
- artifactId: apache-james-mailbox-postgres
- Copy from  mailbox/jpa -> mailbox/postgres
- artifactId: james-server-postgres-app
- Copy from  apps/jpa-app -> apps/postgres-app
tested manually the binding with the subscription module -> create subscription table upon James startup successfully.
…ryptedMailboxMessage, JPAMailboxMessageWithAttachmentStorage
vttranlina and others added 27 commits January 29, 2024 09:29
…Repository interface

- It will more abstract for another implement (here is Postgres)
In the Postgres implementation, we accept to just store the input ZonedDateTime under the UTC time zone. Therefore, we need to be more flexible comparing ZonedDateTime between different time zones.
E.g.:  2014-04-02T19:01Z[UTC] isEqual 2014-04-03T02:01+07:00[Asia/Vientiane]
Wraps Reference, in-reply-to, Mime message id and subject.

Utility to hash it with murmur

Goals:
 - Reduce argument cardinality of numerous methods
 - Centralize parsing from headers (duplicated twice)
 - Centralize hashing (done for cassandra and PG backend)
 - Ease of passing that information downstream (future changes)
Reduce overall cardinality too.
Only on the reading thread side.

This is actually saving us one denormalization.
Use the message table instead. That way
we are strongly consistent.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants