Correctly format followup messages in turn-based (chat) inference #299

philippjbauer · 2023-11-16T20:16:08Z

While working with another model than in my previous tests, I noticed that the model always started its responses with the token that indicates the start of a turn (e.g. <|assistant|>) to the second, third, etc. question I asked.

I investigated and found that it is not enough to just send in the raw text for this model as it happened to be for the other model I initially tested with.

I changed the code so that the correct turn template as per HistoryTransform class is applied before sending the prompt into the InternalChatAsync method.

This resolved the issue and turn-based inference started working as expected.

AsakusaRinne

The overall looks good. Thank you for the contribution!

AsakusaRinne · 2023-11-17T18:14:17Z

LLama/ChatSession.cs

@@ -159,15 +159,15 @@ public async IAsyncEnumerable<string> ChatAsync(string prompt, IInferenceParams?
                InteractiveExecutorState state = (InteractiveExecutorState)executor.GetStateData();
                prompt = state.IsPromptRun
                    ? HistoryTransform.HistoryToText(History)


There's an unexpected behaviour here. If the user input the prompt for the first time run, what he/she want is treat it as a raw text instead of transformed text. You could run the example ChatSessionWithRoleName and set break point to check it.

It's not your fault, because the abstraction here makes the boundary vague. Some logic of executors and chat session need to be refactored to completely fix it. However a quick fix for it now will be better. :)

Yeah, I see what you're saying. Do you think the change to the overlay that uses the ChatHistory is ok to go in by itself? The following messages a user sends in need the template from the IHistoryTransform implementation applied so that the model has the appropriate tokens to continue the conversation.

I think it's better to see if the history is empty before processing the prompt here. If the history is empty, it indicates that the current prompt is the first input, therefore we don't modify it. Otherwise we could resume from history.

If a session was loaded wouldn't the history be duplicated in the state?

I think this is a problem that's created because the actual text is not saved when a session is saved.

I'm working on a proposal for the ChatSession class that will communicate the behavior of the class better through its interface, and will have some more guardrails in place so it can't be used in unexpected ways.

I'll send that in as a separate PR and close this. I don't think we'll get this right without breaking changes.

Actually we may also refactor some parts recently. If you'd like to make some aggressive break changes, you could also open a PR to the preview branch, so that you wouldn't feel limited. :)

philippjbauer added 2 commits November 16, 2023 14:09

Correctly format followup messages in turn-based (chat) inference

629430a

Remove debug output

75932af

AsakusaRinne reviewed Nov 17, 2023

View reviewed changes

philippjbauer closed this Nov 24, 2023

AsakusaRinne mentioned this pull request Nov 28, 2023

feat: allow customized search path for native library loading. #333

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly format followup messages in turn-based (chat) inference #299

Correctly format followup messages in turn-based (chat) inference #299

philippjbauer commented Nov 16, 2023

AsakusaRinne left a comment

AsakusaRinne Nov 17, 2023

philippjbauer Nov 22, 2023

AsakusaRinne Nov 23, 2023

philippjbauer Nov 24, 2023 •

edited

Loading

AsakusaRinne Nov 24, 2023

Correctly format followup messages in turn-based (chat) inference #299

Correctly format followup messages in turn-based (chat) inference #299

Conversation

philippjbauer commented Nov 16, 2023

AsakusaRinne left a comment

Choose a reason for hiding this comment

AsakusaRinne Nov 17, 2023

Choose a reason for hiding this comment

philippjbauer Nov 22, 2023

Choose a reason for hiding this comment

AsakusaRinne Nov 23, 2023

Choose a reason for hiding this comment

philippjbauer Nov 24, 2023 • edited Loading

Choose a reason for hiding this comment

AsakusaRinne Nov 24, 2023

Choose a reason for hiding this comment

philippjbauer Nov 24, 2023 •

edited

Loading