Model Initialized outsize [w_min, w_max]: Pinpointing the bug in issue #604 #635

Zhaoxian-Wu · 2024-04-01T23:25:57Z

Description

As discussed in #604, the model weights will sometimes fall outsize [w_min, w_max].

Bug Pinpoint

The bug happens because of the incorrect initialization of w_min_bound_ and w_max_bound_ (see the code). It seems the following code snippet is designed to deal with the situation where the share weights is deployed and
PulsedDevice.perfect_bias is turned on. When the PulsedDevice.perfect_bias is on, the last dimension of the weights is incorrectly amplified by 100 times, yielding the incorrect active regions and weights.

// perfect bias
if ((par.perfect_bias) && (j == this->x_size_ - 1)) {
  w_scale_up_[i][j] = par.dw_min;
  w_scale_down_[i][j] = par.dw_min;
  w_min_bound_[i][j] = (T)100. * par.w_min; // essentially no bound
  w_max_bound_[i][j] = (T)100. * par.w_max; // essentially no bound
}

TODO

I was trying to fit the bug directly, but I found that I couldn't control shared_weight through the AnalogLinear initialization. I guess we should design a flag here to better control the shared weight behavior.

The text was updated successfully, but these errors were encountered:

maljoras · 2024-04-02T06:51:34Z

Thanks for raising this issue. perfect_bias is indeed some "old" parameter setting, that should only be relevant for analog_bias. Since we have digital_bias now, it should actually be deleted. It has nothing to do with shared_weights so this is not relevant here.

Zhaoxian-Wu · 2024-04-02T17:13:33Z

I see. It seems that using digital_bias instead is a more natural solution. But what does shared_weights do? Does that mean multiple tiles share the same torch array?

maljoras · 2024-04-02T19:25:53Z

Shared weights is saying that the memory to the tile is handled from torch (and not from within C++). This means that also the backward etc is handled by torch. Note that the RPUCuda library is capable of handling the memory of the tile arrays and data internally (as it is a independent library that can be also used independently of pytorch)

Zhaoxian-Wu · 2024-04-08T04:16:12Z

Shared weights is saying that the memory to the tile is handled from torch (and not from within C++). This means that also the backward etc is handled by torch. Note that the RPUCuda library is capable of handling the memory of the tile arrays and data internally (as it is a independent library that can be also used independently of pytorch)

I see. Thanks for your kind explaination :D

kaoutar55 · 2024-05-08T15:18:22Z

@maljoras do we need to remove the perfect_bias in the code flow when we are using digital bias? what do you suggest here? It looks that we have a bug we need to solve.

Borjagodoy · 2024-08-06T10:37:10Z

I think this could be moved to a new issue @kaoutar55 , since the issue was opened because of a problem that finally seemed to be a concept bug, we can open a discussion about the perfect_bias if you like @maljoras and close this issue because actually the issue was solved, or at least that was my impression correct me if I'm wrong @Zhaoxian-Wu

kaoutar55 · 2024-09-18T15:43:11Z

@Zhaoxian-Wu please look at this and try it at your end with the suggested changes. Let us know if the issue is resolved.

Zhaoxian-Wu added the bug Something isn't working label Apr 1, 2024

maljoras self-assigned this Apr 2, 2024

Borjagodoy self-assigned this Aug 6, 2024

Borjagodoy added the status:reviewing label Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Initialized outsize [w_min, w_max]: Pinpointing the bug in issue #604 #635

Model Initialized outsize [w_min, w_max]: Pinpointing the bug in issue #604 #635

Zhaoxian-Wu commented Apr 1, 2024

maljoras commented Apr 2, 2024

Zhaoxian-Wu commented Apr 2, 2024

maljoras commented Apr 2, 2024

Zhaoxian-Wu commented Apr 8, 2024

kaoutar55 commented May 8, 2024

Borjagodoy commented Aug 6, 2024

kaoutar55 commented Sep 18, 2024

Model Initialized outsize [w_min, w_max]: Pinpointing the bug in issue #604 #635

Model Initialized outsize [w_min, w_max]: Pinpointing the bug in issue #604 #635

Comments

Zhaoxian-Wu commented Apr 1, 2024

Description

Bug Pinpoint

TODO

maljoras commented Apr 2, 2024

Zhaoxian-Wu commented Apr 2, 2024

maljoras commented Apr 2, 2024

Zhaoxian-Wu commented Apr 8, 2024

kaoutar55 commented May 8, 2024

Borjagodoy commented Aug 6, 2024

kaoutar55 commented Sep 18, 2024