Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide a normalized algorithm for compute lexical similar score #596

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

IcyTide
Copy link

@IcyTide IcyTide commented Mar 22, 2024

Same sentences can always get a "1" simirlar score like dense way but not a score less than 1 and change with different sentence content.
Different sentences can get an more even similar score distribution.

same sentences can always get a "1" simirlar score like dense way but not a score less than 1 and change with different sentence content. 
different sentences can get an even similar score distribution.
@IcyTide
Copy link
Author

IcyTide commented Mar 22, 2024

Same sentences results:
screenshots

Different sentences results:
screenshots1

@staoxiao
Copy link
Collaborator

Thanks for your contribution!
This method may change the ranking list, so we need some time to conduct experiments to evaluate its performance.

@IcyTide
Copy link
Author

IcyTide commented Mar 25, 2024

Thanks for your contribution! This method may change the ranking list, so we need some time to conduct experiments to evaluate its performance.

This approach might be more explainable for applications compared to the original method, therefore it could perhaps be considered as an additional method, but not a replacement for the original one (depending on the results of your experiments).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants