feat: change oneccl #12296

cranechu0131 · 2024-10-30T02:24:55Z

feat: change oneccl

hkvision · 2024-10-30T02:35:13Z

python/llm/example/GPU/Deepspeed-AutoTP/deepspeed_autotp.py

@@ -135,7 +136,7 @@ def get_int_from_env(env_keys, default):
            actual_output_len = output.shape[1] - input_ids.shape[1]
            output_str = tokenizer.decode(output[0], skip_special_tokens=True)
            avg_time = (end - st) / actual_output_len * 1000
-            print(f'Inference time of generating {actual_output_len} tokens: {end-st} s, average token latency is {avg_time} ms/token.')
+            print(f'Inference time of generating {actual_output_len} tokens: {end-st} s,first token cost {model.first_cost} s, rest tokens average cost {model.rest_cost_mean} s')


space before ,

plusbang

LGTM

cranechu0131 added 6 commits October 30, 2024 10:24

feat: change oneccl

2d965cd

fix: restore llama-70b

d11d88c

fix: remove tab

0b2029a

fix: remove extra blank

46d9d51

small fix

fe2e1a9

add comments

5ff4e23

hkvision reviewed Oct 30, 2024

View reviewed changes

hkvision approved these changes Oct 30, 2024

View reviewed changes

hkvision requested a review from plusbang October 30, 2024 02:36

plusbang approved these changes Oct 30, 2024

View reviewed changes

cranechu0131 added 2 commits October 31, 2024 09:46

fix: add a blank space

303d5c8

Merge branch 'main' into change_oneccl

a71f215

hkvision merged commit 29400e2 into intel-analytics:main Oct 31, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: change oneccl #12296

feat: change oneccl #12296

cranechu0131 commented Oct 30, 2024

hkvision Oct 30, 2024

plusbang left a comment

feat: change oneccl #12296

feat: change oneccl #12296

Conversation

cranechu0131 commented Oct 30, 2024

hkvision Oct 30, 2024

Choose a reason for hiding this comment

plusbang left a comment

Choose a reason for hiding this comment