Perform read lock on LSP requests #1640

msujew · 2024-08-21T12:18:12Z

I've noticed that some long running requests (i.e. completion in large files, sometimes semantic highlighting) can break pretty heavily and lead to unresolved references if the specified document gets a new build triggered. This change adds a new type of read to the WorkspaceLock service that allows to instantly serve a read request - all other requests are queued up until this read request has finished (or has been aborted).

dhuebner · 2024-08-23T12:08:48Z

As what I understood from the discussion in the weekly dev meeting, the problem is the following:
We have a running read operation (e.g. semantic highlighting) and a new write operation request (e.g. document changed) arrives. What happens, is that the write operation resets the document where the read operation still operates on this document reference. This seems to break the document and also the running read operation may fail..?
Introducing a priority read doesn't feels right to me as some? read operations become a priority over the write operations. It also makes the locking logic more complicated and difficult to understand.

Maybe the solution in this PR is the only one, but I still want to share some other ideas (not sure it works well with Langium):

Would creating a new Document instead of reseting an existing one work? Doing so, the running read operation can finish its work on the old outdated document and nothing should break.
What one can also consider is actively trigger canceling on all running read services when a write arrives and wait until they are canceled. After that apply the write operation.

Lotes

Some small things I noticed

Lotes · 2024-08-27T09:52:31Z

packages/langium/src/workspace/workspace-lock.ts

     */
-    read<T>(action: () => MaybePromise<T>): Promise<T>;
+    read<T>(action: () => MaybePromise<T>, priority?: boolean): Promise<T>;


I am associating priority with an ordering/number. I understand the usage here as hasPriority. Maybe there is a better name, like: isUrgent, important, handleFirst...

IMO a Boolean argument is problematic because seeing a method call like read(..., true) hides the semantics behind it. I'd use a string value like 'normal' | 'prioritized' instead.

Lotes · 2024-08-27T09:54:31Z

packages/langium/src/workspace/workspace-lock.ts

@@ -59,8 +64,25 @@ export class DefaultWorkspaceLock implements WorkspaceLock {
        return this.enqueue(this.writeQueue, action, tokenSource.token);
    }

-    read<T>(action: () => MaybePromise<T>): Promise<T> {
-        return this.enqueue(this.readQueue, action);
+    read<T>(action: () => MaybePromise<T>, priority?: boolean): Promise<T> {


Just thinking loud: Is this the first time we have a "priority" queue?
You could refactor a common component/function if not.

The workspace lock is a mutex, not a queue. It features some very specific semantics about read/read with prio/write actions that are likely not relevant for a common component.

spoenemann

I share @dhuebner's concerns. Is it right to push through an LSP request even though the text has changed in the meantime?

Reading the specification section Implementation Considerations, it looks like the best solution would be to return an error code ContentModified when a change has happened.

spoenemann · 2024-08-29T12:47:10Z

packages/langium/src/lsp/language-server.ts

-            return await serviceCall(language, document, params, cancelToken);
+            const result = await lock.read(async () => {
+                return await serviceCall(language, document, params, cancelToken);
+            }, true); // Give this priority, since we already waited until the target state


Almost all LSP requests are now treated with priority? That seems too much to me: especially implicitly sent requests like DocumentHighlights should not block the build process. And what about Completion requests that are sent while typing?

All of these are getting cancelled by the language client - they shouldn't be blocking the workspace in any way for longer than our timeout (5ms).

Where does that 5 ms timeout come from?

spoenemann · 2024-08-29T12:56:31Z

packages/langium/src/workspace/workspace-lock.ts

     */
-    read<T>(action: () => MaybePromise<T>): Promise<T>;
+    read<T>(action: () => MaybePromise<T>, priority?: boolean): Promise<T>;


IMO a Boolean argument is problematic because seeing a method call like read(..., true) hides the semantics behind it. I'd use a string value like 'normal' | 'prioritized' instead.

msujew · 2024-08-29T13:07:23Z

@spoenemann The link you've posted literally says this about changed content:

servers should therefore not decide by themselves to cancel requests simply due to that fact that a state change notification is detected in the queue. As said the result could still be useful for the client.

We do exactly this: Prevent the response from failing by blocking the workspace lock and returning the result. In the meantime, the client is free to cancel the pending request which we completely respect and unblock the workspace lock

spoenemann · 2024-08-29T13:12:43Z

But that section also says:

if a server detects an internal state change (for example, a project context changed) that invalidates the result of a request in execution the server can error these requests with ContentModified

This PR is about long running requests; when the state of a document is changed so that we can no longer finish the request, we should return an error. Note that this is not the same as canceling the request (which should be done only by the client).

spoenemann · 2024-08-29T13:25:43Z

Could it help to use a utility function like this in LSP request processing?

export async function interruptAndCheckDocument(document: LangiumDocument, token: CancellationToken): Promise<void> {
    const previousState = document.state;
    await interruptAndCheck(token);
    if (document.state < previousState) {
        throw new ResponseError(LSPErrorCodes.ContentModified, 'Document content has been modified.');
    }
}

msujew · 2024-08-29T13:28:30Z

This PR is about long running requests; when the state of a document is changed so that we can no longer finish the request

The issue is: We cannot really know? For example, for the document highlight:

User edits text on line 2 and the language client requests a highlighting delta
Server is computing delta request for line 2
Meanwhile, user edits text on line 10 and the language client requests a highlighting delta for that line
???

Aborting the initial operation for line 2 when receiving a document update is not the correct move here - the data for that highlighting is still valid and aborting the request might leave the user with incorrectly highlighted text.

This is the case for most LSP requests. The internal state change the documentation is talking about (like project context switches) are much more fundamental changes than the "normal" document changes that make the result literally useless - however our results are probably still useful (except if the language client deems the change to be too large - in which case we get a cancellation anyway).

spoenemann

Yes, I see the problem with semantic highlighting (Semantic Tokens) (you said document highlighting, but that's a different service). The situation is different for other services. But it's true that it's the client that should decide whether to use a potentially outdated result or not.

We could apply this and then check how it affects the editing experience in larger projects. I'm particularly curious how it performs when completion is continuously triggered while typing.

spoenemann · 2024-09-26T06:57:50Z

packages/langium/src/workspace/workspace-lock.ts

@@ -89,7 +111,7 @@ export class DefaultWorkspaceLock implements WorkspaceLock {
        } else {
            return;
        }
-        this.done = false;
+        this.counter += entries.length;
        await Promise.all(entries.map(async ({ action, deferred, cancellationToken }) => {
            try {
                // Move the execution of the action to the next event loop tick via `Promise.resolve()`


Unrelated to this change, but does awaiting Promise.resolve() really change the execution order in the event loop? Isn't that rather what setImmediate and our utility function delayNextTick are designed for?

We could rewrite this to:

await delayNextTick(); const result = await action(cancellationToken);

spoenemann · 2024-09-26T07:07:08Z

packages/langium/src/workspace/workspace-lock.ts

+            Promise.resolve(action()).then(
+                result => end(() => deferred.resolve(result)),
+                err => end(() => deferred.reject(err))
+            );


I see two potential issues with this code:

The solution with the local end function is a more complicated way of expressing a finally block.

I suspect that Promise.resolve(action()) does not behave as we want when the action throws an error – the error would just be propagated instead of rejecting the resulting promise.

I suggest making this whole method async so we can await the action and wrap that in a try-finally block.

spoenemann · 2024-09-26T07:18:14Z

packages/langium/src/lsp/language-server.ts

-            return await serviceCall(language, params, cancelToken);
+            const result = await lock.read(async () => {
+                return await serviceCall(language, params, cancelToken);
+            }, true); // Give this priority, since we already waited until the target state


I think this can be simplified to

await lock.read(() => serviceCall(language, params, cancelToken), true)

(same in createServerRequestHandler and createRequestHandler).

spoenemann · 2024-09-26T08:05:37Z

Question: with this change, does it make any sense for LSP services to use interruptAndCheck? I see it currently used by inlay-hint-provider, semantic-token-provider and workspace-symbol-provider.

Should we make it clear in the function documentation that it should be used only in the document building process and the associated services?

Perform read lock on LSP requests

fe3fa05

msujew added the LSP Language Server Protocol integration label Aug 21, 2024

msujew added this to the v3.2.0 milestone Aug 22, 2024

Lotes reviewed Aug 27, 2024

View reviewed changes

spoenemann reviewed Aug 29, 2024

View reviewed changes

msujew removed this from the v3.2.0 milestone Aug 29, 2024

spoenemann reviewed Sep 26, 2024

View reviewed changes

Keep track of time in cancellation token

434c584

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perform read lock on LSP requests #1640

Perform read lock on LSP requests #1640

msujew commented Aug 21, 2024 •

edited

Loading

dhuebner commented Aug 23, 2024

Lotes left a comment

Lotes Aug 27, 2024

spoenemann Aug 29, 2024

Lotes Aug 27, 2024

msujew Aug 29, 2024

spoenemann left a comment

spoenemann Aug 29, 2024

msujew Aug 29, 2024

spoenemann Sep 26, 2024

spoenemann Aug 29, 2024

msujew commented Aug 29, 2024

spoenemann commented Aug 29, 2024 •

edited

Loading

spoenemann commented Aug 29, 2024

msujew commented Aug 29, 2024 •

edited

Loading

spoenemann left a comment

spoenemann Sep 26, 2024

spoenemann Sep 26, 2024

spoenemann Sep 26, 2024

spoenemann commented Sep 26, 2024

Perform read lock on LSP requests #1640

Are you sure you want to change the base?

Perform read lock on LSP requests #1640

Conversation

msujew commented Aug 21, 2024 • edited Loading

dhuebner commented Aug 23, 2024

Lotes left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spoenemann left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

msujew commented Aug 29, 2024

spoenemann commented Aug 29, 2024 • edited Loading

spoenemann commented Aug 29, 2024

msujew commented Aug 29, 2024 • edited Loading

spoenemann left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spoenemann commented Sep 26, 2024

msujew commented Aug 21, 2024 •

edited

Loading

spoenemann commented Aug 29, 2024 •

edited

Loading

msujew commented Aug 29, 2024 •

edited

Loading