Implement persistent commitments #543

arthurpaulino · 2023-07-18T22:48:35Z

Persist commitments
Load persisted commitments
Content-address proofs to generate stable IDs
Don't prove cached proofs
Revamp FieldData (credits to @huitseeker)

Closes #529

huitseeker

This is a nice improvement on the existing but I would urge you to consider whether a field label is everything you need, or if you want to start thinking about some data you can deserialize to a Lang<F, Coproc<F>>

huitseeker · 2023-07-18T23:25:55Z

src/cli/commitment.rs

+
+/// Holds data for commitments.
+///
+/// WARNING: CONTAINS PRIVATE DATA


Does that warning differ qualitatively from any other instance of a store?

Yes, this one contains secrets

That I would expect. The idea that other stores don't is more surprising.

In practical terms, consider the code base you're editing here will be reviewed by a professional cryptography auditor. How does the comment you're offering here help them contextualize how files for Commitment should be considered, as opposed to other store files?

Will address this tomorrow

src/cli/commitment.rs

src/cli/field_data.rs

huitseeker · 2023-07-19T01:12:33Z

src/cli/field_data.rs

+}
+
+#[allow(dead_code)]
+impl FieldData {


Ah, but with this version you are moving the conversion to/from bytes at the wrapping time. I suspect you could peg what you want on a

struct Labeled<T: Serialize + DeserializeOwned> { label: Language field, val: T, }

For this case I do intend to deserialize in two steps. We first read from FS to know the field and then we read from the vector to get the data with the correct field elements

Yes. You can definitely have those two steps, but in sequence within the same function, with the above structure.

I don't get the idea. I don't want to deserialize the vector of bytes if there is some inconsistency in the field. I want to error earlier. In other words, the vector is desirable

For the record, I did try something like you propose in my first attempt but then I got stuck because Rust doesn't have dependent types. I wanted T<F> where F: LurkField

Hmmm.

As partially unrelated topic, I don't think any strategy that persists important Lurk data using an ad-hoc/bincode serialization of LanguageField is a good idea. It would be A Bad Thing if changes to that enum (which are so likely they are predictable, so predictable we should plan for them) led to supposedly durable data becoming unreadable.

How can we encode that information, then? Should we create another enum that we try to assure its stability?

If you want to have a completely general, 'dynamic' serialization, then it's going to require more design.

But is that really what is needed here? I think you just need to know what LanguageField you are currently using. Then everything will be written and read using that LanguageField. Moreover, it follows from our general cryptographic assumptions that any value used as a commitment (or as the hash part of a Lurk expression) cannot be produced by hashing some preimage in another LanguageField.

Therefore, as long as you are looking values up by their hash/digest, then it's fine to completely segregate them by field. So, given that this work is still using a filesystem-based commitment store, you could have a directory structure like:

.lurk/commitments/pallas .lurk/commitments/vesta .lurk/commitments/bls12-81 .lurk/commitments/grumpkin …

If the current LanguageField is pallas, then you can write all commitments to the .lurk/commitments/pallas directory and look all commitments up there too. There is never any possibility that a valid commitment (say you are looking it up) expressed as an element of pallas::Scalar will be stored elsewhere.

Also, since (as above), we also cannot have collisions between fields (assuming indexes are always hashes, and the chosen field/hash combinations still preserve our security assumptions) you could even get the 'dynamic' behavior by searching for a given commitment in all available field directories. To prevent having to search in multiple places, you could use symlinks (for example) to provide a single index (perhaps hierarchically structured to avoid too-large directories, etc.)

Obviously, with a more powerful database management system than 'the file system', a different approach could be taken. But I think the above (especially the simplest version, which is probably all we need initially) should be fine.

The point is: I think you may be trying to solve the wrong problem. Certainly if the goal is a quick PR that decrees a format through code then that is the case.

While we may want to eventually have a format that allows dynamically mixing field types, that will need to be worked out as a careful extension of z_data.

That works for commitments, but lurk verify <proof_id> would need to ask the user for the field, which is annoying. So I thought that we might as well just use the same infra that's already available to give us extra consistency, assuring that we won't open a commitment that was generated in a field while we're in another field.

Aren't proofs also always with respect to a known prover/verifier, which is itself necessarily parameterized on curve+field?

Also, why the heck are proofs (apparently) being stored with an id that is the timestamp rather than using content-addressing as previously?

Question: Are you trying to support verifying any Lurk proof, or do we assume that a given REPL session will only verify proofs of statements in the current field?

Either way, if you content-address the proofs, you should be able to use the symlink approach as above. (NOTE: in that model, you will need to check the actual location of resolved symlinks to get that meta-data — but I still think that's not what should be needed here.)

Opinion: Proof IDs should be the content address of the claim — just as they are in fcomm and clutch. This is actually important because it allows for caching of proofs. It's easy to imagine applications for which equivalent proofs are requested more than once (even many, many times). For example, that's how the current Lurk website works (or was designed to): we serve real proofs of expected claims in a way that is cost effective but still accurate.

A real outsourced-but-provable computation service could do the same.

arthurpaulino · 2023-07-19T13:34:31Z

This is a nice improvement on the existing but I would urge you to consider whether a field label is everything you need, or if you want to start thinking about some data you can deserialize to a Lang<F, Coproc<F>>

Yeah, it will be easier to see once we generalize Lurk proofs across more backends (and fields). For now, I think it is enough.

huitseeker

Going back to the core point: https://gist.github.com/huitseeker/2c6bc8bbecba59cdd4e8c6cbfadebc5c
Note this is independent of what you actually use for a type label, so you could extend this solution to carry field, group, versioning information (...) to your satisfaction.

src/cli/field_data.rs

src/cli/repl.rs

src/cli/field_data.rs

huitseeker

Thanks a lot! this looks💥!

arthurpaulino added 4 commits July 18, 2023 17:53

meta commit and hide

09af891

progress on fetch

076b70d

resolve lifetime issue

2fce9ed

finish fetch

68d26bd

arthurpaulino requested review from huitseeker and porcuquine July 18, 2023 22:48

arthurpaulino requested a review from a team as a code owner July 18, 2023 22:48

huitseeker reviewed Jul 19, 2023

View reviewed changes

arthurpaulino added 4 commits July 18, 2023 23:19

improve Commitment implementations; open meta command that prints data

4d4b44d

add pub(crate)

4cd3914

enclose non-wasm code in particular modules

c1f3fb8

improved remarks on private data

af42e1b

arthurpaulino requested a review from huitseeker July 19, 2023 13:32

arthurpaulino added 5 commits July 19, 2023 11:03

better generalizations in lurk_proof

5fe6b6a

safer extraction of FieldData

d87c21d

cleaner user feedback on fetch

a4cb3e4

remove unnecessary allow(dead_code) flag

b79a72d

clean up paths.rs

55be212

huitseeker reviewed Jul 19, 2023

View reviewed changes

src/cli/field_data.rs Outdated Show resolved Hide resolved

review suggestions + revamp

d66321e

arthurpaulino requested a review from huitseeker July 19, 2023 23:55

porcuquine reviewed Jul 20, 2023

View reviewed changes

src/cli/repl.rs Outdated Show resolved Hide resolved

huitseeker reviewed Jul 20, 2023

View reviewed changes

src/cli/field_data.rs Outdated Show resolved Hide resolved

src/cli/field_data.rs Show resolved Hide resolved

arthurpaulino added 5 commits July 19, 2023 23:11

encoding proof claim as Lurk data

b0a0fde

clean up

e1aa3cc

document field_data

52b4d2b

add tests for field_data

1d9deae

change constructor names on Enum2 for a better POC test

e3c7ca9

arthurpaulino requested a review from huitseeker July 20, 2023 10:52

arthurpaulino requested a review from porcuquine July 20, 2023 10:52

huitseeker previously approved these changes Jul 20, 2023

View reviewed changes

fix docstring for LurkProofMeta

46bf080

arthurpaulino dismissed huitseeker’s stale review via 46bf080 July 20, 2023 14:22

huitseeker enabled auto-merge July 20, 2023 14:45

huitseeker approved these changes Jul 20, 2023

View reviewed changes

huitseeker added this pull request to the merge queue Jul 20, 2023

Merged via the queue into master with commit 4a92ae0 Jul 20, 2023
2 checks passed

huitseeker deleted the ap/meta-commits branch July 20, 2023 15:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement persistent commitments #543

Implement persistent commitments #543

arthurpaulino commented Jul 18, 2023 •

edited

Loading

huitseeker left a comment

huitseeker Jul 18, 2023

arthurpaulino Jul 19, 2023

huitseeker Jul 19, 2023 •

edited

Loading

arthurpaulino Jul 19, 2023

huitseeker Jul 19, 2023

arthurpaulino Jul 19, 2023

huitseeker Jul 19, 2023

arthurpaulino Jul 19, 2023

arthurpaulino Jul 19, 2023

porcuquine Jul 19, 2023

arthurpaulino Jul 19, 2023 •

edited

Loading

porcuquine Jul 19, 2023

arthurpaulino Jul 19, 2023 •

edited

Loading

porcuquine Jul 19, 2023

arthurpaulino commented Jul 19, 2023

huitseeker left a comment •

edited

Loading

huitseeker left a comment

Implement persistent commitments #543

Implement persistent commitments #543

Conversation

arthurpaulino commented Jul 18, 2023 • edited Loading

huitseeker left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

huitseeker Jul 19, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arthurpaulino Jul 19, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arthurpaulino Jul 19, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arthurpaulino commented Jul 19, 2023

huitseeker left a comment • edited Loading

Choose a reason for hiding this comment

huitseeker left a comment

Choose a reason for hiding this comment

arthurpaulino commented Jul 18, 2023 •

edited

Loading

huitseeker Jul 19, 2023 •

edited

Loading

arthurpaulino Jul 19, 2023 •

edited

Loading

arthurpaulino Jul 19, 2023 •

edited

Loading

huitseeker left a comment •

edited

Loading