[OSPP23] Add llama2 lowering end2end #216

guessmewho123888 · 2023-09-29T13:39:48Z

No description provided.

notion-workspace · 2023-10-09T04:48:13Z

Get the importer ready.

notion-workspace · 2023-10-09T04:48:14Z

Get the importer ready.

notion-workspace · 2023-10-09T04:48:35Z

Get the importer ready.

weilinquan · 2023-10-22T02:44:06Z

test script and readme.md are in examples/MLIRLlama/

zhanghb97

@weilinquan Nice!

We also need to add the following items:

Docstrings for conversion functions (see the example for addmm)
Tests for these conversion functions (see here as an example)
The current example process is somewhat complex, and we expect to use CMake to integrate the various processes, which we will discuss in detail at our group meeting.

zhanghb97 · 2023-10-23T01:39:33Z

examples/DLModel/makefile

+		--reconcile-unrealized-casts | \
+	${MLIR_TRANSLATE} \
+		-mlir-to-llvmir | \
+	${LLC} -mtriple=x86_64 -filetype=obj --relocation-model=pic ${OPT_FLAG} -o resnet18.o


This PR is for LLaMA-related implementations only; don't modify the ResNet part here.

zhanghb97 · 2023-10-23T01:45:42Z

examples/MLIRLlama/requirements.txt

We have added the requirements.txt at the root of our project.
https://github.com/buddy-compiler/buddy-mlir/blob/main/requirements.txt

zhanghb97 · 2023-10-23T01:51:18Z

frontend/Python/ops/tosa.py

Should this tosa.py be in Yuliang's PR?

Lester-1 · 2023-10-25T11:14:59Z

examples/BuddyLlama/llama-main.cpp

  // Print the tokenized result
  cout << "Get User input:" << pureStrContainer.revert(pureStrContainer)
       << endl;
-  cout << "[Buddy] Tokenize input time: " << buddyTokenizeTime * 1000 << "ms"
+  cout << "[Buddy] Tokenize input time: " << buddyTokenizeTime.count() * 1000 << "ms"


What is the unit of time measurement here?
The unit multiplied by 1000 here is “ms”, and the unit divided by 1000 below is “s”.

Yes! I have change buddyTokenizeTime's time unit to milliseconds and update this cout.

zhanghb97

Please update the README by adding the necessary steps to run the E2E example.

zhanghb97 · 2023-10-25T18:24:24Z

frontend/Python/ops/linalg.py

@@ -0,0 +1,2532 @@
+# ===- linalg.py -----------------------------------------------------------------


Wrong format.
Please follow the 80-col limitation.

zhanghb97 · 2023-10-25T18:55:49Z

frontend/Python/frontend.py

@@ -241,7 +275,7 @@ def generated_func(*args):
                        for output_arg in output_node_args:
                            op = self._symbol_table.get((str(output_arg), 0))
                            returns.append(op)
-
+                        returns = returns[0]


Why return only the first returns value? This will trigger the check-buddy fail (the test_var_mean case returns two values).

In llama2，it will return many tensors include intermediate tensors for compute gradient, but we only need the first tensor to generate next token.

Maybe we should use a better way to solve it

zhanghb97 · 2023-10-25T19:09:08Z

examples/BuddyLlama/test-llama2.py

+from buddy.compiler.frontend import DynamoCompiler
+from buddy.compiler.ops import tosa
+
+tokenizer = LlamaTokenizer.from_pretrained('/llama-2-7B-hf')


The /llama-2-7B-hf seems to cause an error:

huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: '/llama-2-7B-hf'.

Did I miss something important, or should we remove the /?

Ah, I recalled the details about the configurations here. We should modify the path to the huggingface version of the llama model, right?

I will change it from '/llama-2-7B-hf' to 'path to huggingface llama2 model'

zhanghb97

LGTM! Congrats 🎉🎉🎉
Thank you for your contribution. Hope you enjoyed the OSPP project!

notion-workspace · 2023-11-03T03:18:40Z

Add Linalg operations generation

zhanghb97 added this to the llama-inference milestone Oct 9, 2023

fix PR error

ffa3527

guessmewho123888 force-pushed the main branch from 2663d80 to ffa3527 Compare October 18, 2023 00:59

guessmewho123888 added 2 commits October 21, 2023 07:33

Merge branch 'buddy'

cc3b1af

update linalg importer and params process

d592cf1

zhanghb97 reviewed Oct 23, 2023

View reviewed changes

guessmewho123888 added 3 commits October 25, 2023 01:09

update tosa lower

0641e4a

upate linalg implementation and tests

820766c

upate linalg implementation and tests

988f8bd

Lester-1 reviewed Oct 25, 2023

View reviewed changes

fix time error

8cfad76

zhanghb97 reviewed Oct 25, 2023

View reviewed changes

guessmewho123888 added 2 commits October 26, 2023 04:57

update readme

b8168ec

update test format

e9e055c

zhanghb97 approved these changes Oct 28, 2023

View reviewed changes

zhanghb97 merged commit e922c64 into buddy-compiler:main Oct 28, 2023

ShiHaoGao pushed a commit to ShiHaoGao/buddy-mlir that referenced this pull request Oct 18, 2024

[OSPP23] Add llama2 importer and E2E example. (buddy-compiler#216)

47ab06d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OSPP23] Add llama2 lowering end2end #216

[OSPP23] Add llama2 lowering end2end #216

guessmewho123888 commented Sep 29, 2023

notion-workspace bot commented Oct 9, 2023

notion-workspace bot commented Oct 9, 2023

notion-workspace bot commented Oct 9, 2023

weilinquan commented Oct 22, 2023

zhanghb97 left a comment

zhanghb97 Oct 23, 2023

zhanghb97 Oct 23, 2023

zhanghb97 Oct 23, 2023

Lester-1 Oct 25, 2023

weilinquan Oct 25, 2023

zhanghb97 left a comment

zhanghb97 Oct 25, 2023

zhanghb97 Oct 25, 2023

weilinquan Oct 26, 2023

weilinquan Oct 26, 2023

zhanghb97 Oct 25, 2023

zhanghb97 Oct 25, 2023

weilinquan Oct 26, 2023

weilinquan Oct 26, 2023

zhanghb97 left a comment

notion-workspace bot commented Nov 3, 2023

		@@ -0,0 +1,2532 @@
		# ===- linalg.py -----------------------------------------------------------------

[OSPP23] Add llama2 lowering end2end #216

[OSPP23] Add llama2 lowering end2end #216

Conversation

guessmewho123888 commented Sep 29, 2023

notion-workspace bot commented Oct 9, 2023

notion-workspace bot commented Oct 9, 2023

notion-workspace bot commented Oct 9, 2023

weilinquan commented Oct 22, 2023

zhanghb97 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhanghb97 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhanghb97 left a comment

Choose a reason for hiding this comment

notion-workspace bot commented Nov 3, 2023