Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FEAT: Support 1.58-bit LLMs training #114

Open
younesbelkada opened this issue Mar 21, 2024 · 1 comment
Open

FEAT: Support 1.58-bit LLMs training #114

younesbelkada opened this issue Mar 21, 2024 · 1 comment
Labels
enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed Low Priority

Comments

@younesbelkada
Copy link

younesbelkada commented Mar 21, 2024

Hi there!

image

Microsoft have just released the full handbook for reproduing the 1-bit LLM paper: https://github.com/microsoft/unilm/blob/master/bitnet/The-Era-of-1-bit-LLMs__Training_Tips_Code_FAQ.pdf

Would be exciting to see if we can have an official implementation of that paper in nanotron, and support 1-bit LLM inference directly in transformers for the models that have been trained with that method using nanotron

cc @NouamaneTazi @xrsrke @3outeille @thomwolf

cc original author:
@shumingma

@younesbelkada younesbelkada changed the title FEAT: Support 1.58-bit LLMs FEAT: Support 1.58-bit LLMs training Mar 21, 2024
@xrsrke
Copy link
Member

xrsrke commented Mar 25, 2024

@younesbelkada, hey, thanks for the suggestion. I've talked with @NouamaneTazi; we agree that we will add support for 1bit later on for consumer hardware because FP8 is the coolest. You get a speedup in training (FP8 matmul, this is very important), memory reduction, and it's tested at scale (180B).... So, currently, we focus on FP8 :)

@xrsrke xrsrke added enhancement New feature or request help wanted Extra attention is needed Low Priority Medium Priority and removed Low Priority labels Mar 25, 2024
@xrsrke xrsrke added good first issue Good for newcomers Low Priority and removed Medium Priority labels Apr 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed Low Priority
Projects
None yet
Development

No branches or pull requests

2 participants