Store refactor #1010

gabriel-barrett · 2024-01-02T17:05:22Z

This PR adds a raw pointer type, which is the payload for the pointer type (that carries a tag as well). We can understand a raw pointer as a delayed hash, which is ultimately a field element once you hydrate.

The store is also changed so that it uses raw pointers instead of pointers. This is because we can view tags as field elements, which is a particular kind of raw pointer. This gives more flexibility to the store since it can now represent hashes where the first element is not necessarily a tag, but a general field element (or raw pointer). This will be used, for example, for the environment optimization, which will replace environment conses with a special operation which is a hash of 3 raw values and 1 tag. I've previously done this on this branch but it was a hacky solution on the old store. The new store will also give LEM the ability of expressing pure field elements, with delayed hashing, without needing to create dummy tag variables.

I tried making the API as similar as possible, but there were some changes here and there. I've also tried to make it more generic, to make it easier to add new hashes. There are some functions that use const generics which could use a little polishing; I've written some notes on the comments

gabriel-barrett · 2024-01-02T17:45:35Z

!gpu-benchmark

arthurpaulino · 2024-01-02T19:07:53Z

!gpu-benchmark

arthurpaulino

It would be nice to have the output of cargo criterion --bench end2end on this branch relative to the numbers from main so we have more data about the consequences of these changes

src/lem/pointers.rs

src/lem/store.rs

samuelburnham · 2024-01-02T20:06:26Z

!gpu-benchmark

github-actions · 2024-01-02T22:28:14Z

Benchmark for `031d3a6`

Click to view benchmark

Test	Base	PR	%
LEM Fibonacci Prove - rc = 100/fib/num-100	2.4±0.00s	2.4±0.00s	0.00%
LEM Fibonacci Prove - rc = 100/fib/num-200	4.6±0.01s	4.6±0.01s	0.00%
LEM Fibonacci Prove - rc = 600/fib/num-100	1985.7±5.97ms	1987.3±4.85ms	+0.08%
LEM Fibonacci Prove - rc = 600/fib/num-200	4.5±0.01s	4.5±0.01s	0.00%

arthurpaulino

Sorry, more comments on docstrings and I think it will be ready to go

src/lem/store.rs

arthurpaulino · 2024-01-03T00:00:36Z

Wonderful. Thank you very much!

The last thing: running cargo criterion --bench end2end on this branch after running it on main so we have more fine grained perf data.

As a suggestion, the performance diff can be part of the PR description so it's easier to track such perf changes (or absence of changes).

huitseeker

This PR adds a raw pointer type, which is the payload for the pointer type (that carries a tag as well). We can understand a raw pointer as a delayed hash, which is ultimately a field element once you hydrate.
The store is also changed so that it uses raw pointers instead of pointers. This is because we can view tags as field elements, which is a particular kind of raw pointer. This gives more flexibility to the store since it can now represent hashes where the first element is not necessarily a tag, but a general field element (or raw pointer). This will be used, for example, for the environment optimization, which will replace environment conses with a special operation which is a hash of 3 raw values and 1 tag. [...] The [...] store will also give LEM the ability of expressing pure field elements, with delayed hashing, without needing to create dummy tag variables.

The above is excellent documentation, because it pulls in one place several pieces (that are documented in several places) and ties them up at a high level. I'd consider leaving an edited version of this in a module comment (e..g lem::store?)

This is trying to use const generics (which are currently sorely limited, see e.g.) to link our hash arity to the hashing API of neptune, which morally is using generic_array to represent that same arity. Performing that link may not be necessary today, but FWIW it is a solved problem (as much as is possible in today's compiler) in https://github.com/RustCrypto/utils/tree/master/hybrid-array

huitseeker · 2024-01-03T00:17:12Z

src/lem/store.rs

+        &self,
+        raw_ptrs: &[RawPtr; N],
+    ) -> Option<[Ptr; P]> {
+        assert_eq!(P * 2, N);


Yeah, generic_const_exprs is sorely missing here.

Thankfully, you're using this for array operations, and we already have generic-array in our dependencies. We're therefore in the special case where we can use this obscure but beautiful crate, which I believe is the only way to use const generic array operations with a path to const generic migration when the Rust compiler supports it:
https://docs.rs/hybrid-array/latest/hybrid_array/
https://github.com/RustCrypto/utils/tree/master/hybrid-array

arthurpaulino

These are the sensible numbers from my machine:

end2end_benchmark/end2end_go_base_nova/_10_0
                        time:   [626.69 ms 627.41 ms 628.10 ms]
                        change: [-0.5097% -0.3768% -0.2495%] (p = 0.00 < 0.05)
                        Change within noise threshold.

store_benchmark/store_go_base_pallas/_10_16
                        time:   [157.07 µs 157.15 µs 157.22 µs]
                        change: [+3.0513% +3.0999% +3.1489%] (p = 0.00 < 0.05)
                        Performance has regressed.
store_benchmark/store_go_base_pallas/_10_160
                        time:   [155.36 µs 155.44 µs 155.51 µs]
                        change: [-0.1862% -0.0101% +0.1333%] (p = 0.91 > 0.05)
                        No change in performance detected.

hydration_benchmark/hydration_go_base_pallas/_10_16
                        time:   [884.80 ns 884.86 ns 884.91 ns]
                        change: [+0.3267% +0.3518% +0.3757%] (p = 0.00 < 0.05)
                        Change within noise threshold.
hydration_benchmark/hydration_go_base_pallas/_10_160
                        time:   [884.53 ns 884.62 ns 884.72 ns]
                        change: [+0.3261% +0.3548% +0.3833%] (p = 0.00 < 0.05)
                        Change within noise threshold.

eval_benchmark/eval_go_base_pallas/_10_16
                        time:   [6.7053 ms 6.7144 ms 6.7287 ms]
                        change: [+1.0284% +1.1951% +1.3932%] (p = 0.00 < 0.05)
                        Performance has regressed.
eval_benchmark/eval_go_base_pallas/_10_160
                        time:   [66.937 ms 67.014 ms 67.109 ms]
                        change: [-1.1014% -0.9183% -0.7256%] (p = 0.00 < 0.05)
                        Change within noise threshold.

They look fine to me. Great work!

pos and index (from the last commit) are pretty much screaming for a roundtrip test. After that I will stamp this PR

arthurpaulino

🏬 🚚 🏪

huitseeker · 2024-01-04T21:43:24Z

src/lem/pointers.rs

+        match self {
+            RawPtr::Atom(x) => Some(*x),
+            _ => None,
+        }


With match_opt! you can turn this into

match_opt!(self, RawPtr::Atom(x) => *x)

huitseeker · 2024-01-04T21:49:05Z

src/lem/tag.rs

+        }
+    }
+
+    pub fn pos(i: usize) -> Option<Self> {


We already have a tag <-> usize for each of the basic enums you're papering over. Have you thought about making this conversion defined in terms of those individual conversions plus a trio of Expr_offset == 0, Cont_offset = 12 Op1_offset, Op2_offset, etc ..?

Another approach would be a proc_macro that walks the definition of the Tag enum.

My problem is that with this code, the maintenance is going to be a pain: any mis-ordering of fields will be a bug.

gabriel-barrett requested review from a team as code owners January 2, 2024 17:05

arthurpaulino reviewed Jan 2, 2024

View reviewed changes

gabriel-barrett force-pushed the store-refactor branch from d128fdc to cc602b2 Compare January 2, 2024 22:16

arthurpaulino reviewed Jan 2, 2024

View reviewed changes

gabriel-barrett force-pushed the store-refactor branch 2 times, most recently from 3e4479a to 70ee021 Compare January 2, 2024 23:54

huitseeker reviewed Jan 3, 2024

View reviewed changes

gabriel-barrett added 3 commits January 3, 2024 14:57

WIP raw pointers and store

deba627

const generic store functions

cd31d0a

Finished raw store

237fd72

gabriel-barrett force-pushed the store-refactor branch from cc5d9b5 to bfc78a0 Compare January 3, 2024 17:57

gabriel-barrett added 6 commits January 3, 2024 22:05

Integrated raw_store into store

963c836

Store simplification

fe8e602

Renames, comments, tests, simplifications

7db1e0d

Updated some documentation

f72246a

Utility macros

87d2756

Reverted Hash3 and comms change

c25d5fb

gabriel-barrett force-pushed the store-refactor branch from aad6df6 to c25d5fb Compare January 4, 2024 01:06

Tag interning optimization

c13c502

arthurpaulino reviewed Jan 4, 2024

View reviewed changes

LEM tag own file and tests

337ef0a

gabriel-barrett force-pushed the store-refactor branch from 4b40af9 to 337ef0a Compare January 4, 2024 21:16

arthurpaulino previously approved these changes Jan 4, 2024

View reviewed changes

huitseeker reviewed Jan 4, 2024

View reviewed changes

gabriel-barrett dismissed arthurpaulino’s stale review via def5463 January 5, 2024 17:13

Better conversion functions

250b2cc

gabriel-barrett force-pushed the store-refactor branch from def5463 to 250b2cc Compare January 5, 2024 17:49

arthurpaulino approved these changes Jan 5, 2024

View reviewed changes

gabriel-barrett enabled auto-merge January 5, 2024 18:11

gabriel-barrett added this pull request to the merge queue Jan 5, 2024

github-actions bot pushed a commit that referenced this pull request Jan 5, 2024

[automated] GPU Benchmark from PR #1010

b595429

Merged via the queue into argumentcomputer:main with commit 993e958 Jan 5, 2024
12 checks passed

gabriel-barrett deleted the store-refactor branch January 5, 2024 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store refactor #1010

Store refactor #1010

gabriel-barrett commented Jan 2, 2024 •

edited

Loading

gabriel-barrett commented Jan 2, 2024

arthurpaulino commented Jan 2, 2024

arthurpaulino left a comment

samuelburnham commented Jan 2, 2024

github-actions bot commented Jan 2, 2024

arthurpaulino left a comment

arthurpaulino commented Jan 3, 2024 •

edited

Loading

huitseeker left a comment

huitseeker Jan 3, 2024

arthurpaulino left a comment

arthurpaulino left a comment

huitseeker Jan 4, 2024

huitseeker Jan 4, 2024

Store refactor #1010

Store refactor #1010

Conversation

gabriel-barrett commented Jan 2, 2024 • edited Loading

gabriel-barrett commented Jan 2, 2024

arthurpaulino commented Jan 2, 2024

arthurpaulino left a comment

Choose a reason for hiding this comment

samuelburnham commented Jan 2, 2024

github-actions bot commented Jan 2, 2024

Benchmark for 031d3a6

arthurpaulino left a comment

Choose a reason for hiding this comment

arthurpaulino commented Jan 3, 2024 • edited Loading

huitseeker left a comment

Choose a reason for hiding this comment

huitseeker Jan 3, 2024

Choose a reason for hiding this comment

arthurpaulino left a comment

Choose a reason for hiding this comment

arthurpaulino left a comment

Choose a reason for hiding this comment

huitseeker Jan 4, 2024

Choose a reason for hiding this comment

huitseeker Jan 4, 2024

Choose a reason for hiding this comment

gabriel-barrett commented Jan 2, 2024 •

edited

Loading

Benchmark for `031d3a6`

arthurpaulino commented Jan 3, 2024 •

edited

Loading