Better support for large data #88

jiangyi15 · 2023-07-29T14:55:58Z

Use tf.data.Dataset for large data in the option

data:
    lazy_call: True

codecov · 2023-07-30T02:05:27Z

Codecov Report

Merging #88 (cc8ba51) into dev (f3e7662) will increase coverage by 0.21%.
The diff coverage is 76.64%.

@@            Coverage Diff             @@
##              dev      #88      +/-   ##
==========================================
+ Coverage   73.83%   74.05%   +0.21%     
==========================================
  Files         104      104              
  Lines       14640    14768     +128     
  Branches     2717     2746      +29     
==========================================
+ Hits        10810    10937     +127     
+ Misses       3177     3163      -14     
- Partials      653      668      +15

Flag	Coverage Δ
unittests	`74.05% <76.64%> (+0.21%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
tf_pwa/data_trans/helicity_angle.py	`93.65% <ø> (ø)`
tf_pwa/cal_angle.py	`76.38% <20.00%> (+1.38%)`	⬆️
tf_pwa/config_loader/multi_config.py	`62.16% <33.33%> (-1.11%)`	⬇️
tf_pwa/fit.py	`22.58% <50.00%> (+0.35%)`	⬆️
tf_pwa/config_loader/plot.py	`69.10% <64.51%> (-0.23%)`	⬇️
tf_pwa/amp/core.py	`74.43% <66.66%> (-0.27%)`	⬇️
tf_pwa/config_loader/data.py	`61.66% <70.83%> (+1.12%)`	⬆️
tf_pwa/utils.py	`66.18% <77.77%> (+0.52%)`	⬆️
tf_pwa/model/model.py	`50.44% <81.25%> (+0.82%)`	⬆️
tf_pwa/data.py	`75.88% <83.11%> (+0.04%)`	⬆️
... and 4 more

... and 1 file with indirect coverage changes

jiangyi15 · 2023-07-31T05:53:36Z

Currently, the best option (26) for large dataset is

data:   
    lazy_call: True
    use_tf_function: True
    no_id_cached: True
    jit_compile: True
    cached_lazy_call: cached_data/

batch=100000
Step time (nll_grad()) for 1M data + 10M PHSP MC about 10s.

jiangyi15 added 5 commits July 23, 2023 14:48

fixed: shape with dataset.map

9e7a421

Merge remote-tracking branch 'origin/dev' into dataloader

d83577c

refactor: use dataset for lazycall

b4d378e

remove cache to reduce memory cost

2df2dd5

ci: fixed error in ci

0f27750

jiangyi15 added 6 commits July 30, 2023 18:58

feat: callback for fit

8b922d1

feat: callback for fit

957e1d0

feat: suport tf function in lazycall

5f41ecc

ci: fixed ci error

b21872f

ci: fixed ci error

7e9a7ce

feat: cached_lazy_call

cfb7270

jiangyi15 added 6 commits July 31, 2023 18:50

feat: batch in plot_partial_wave

b6cf9d4

refactor: cached_lazy_call

d88a8f5

fixed: hessian with lazycall

4724379

ci: add tests for lazycall

7f85fd7

ci: add tests for lazycall

cf44fe4

feat: only_left_angle to reduce memory cost

cc8ba51

jiangyi15 merged commit 377a000 into dev Aug 3, 2023
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better support for large data #88

Better support for large data #88

jiangyi15 commented Jul 29, 2023

codecov bot commented Jul 30, 2023 •

edited

Loading

jiangyi15 commented Jul 31, 2023

Better support for large data #88

Better support for large data #88

Conversation

jiangyi15 commented Jul 29, 2023

codecov bot commented Jul 30, 2023 • edited Loading

Codecov Report

jiangyi15 commented Jul 31, 2023

codecov bot commented Jul 30, 2023 •

edited

Loading