Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature request: ignoring specified template #716

Closed
codingCoffee opened this issue Mar 8, 2022 · 4 comments
Closed

feature request: ignoring specified template #716

codingCoffee opened this issue Mar 8, 2022 · 4 comments
Labels
enhancement New feature or request

Comments

@codingCoffee
Copy link

This is more of a feature request and less of an issue. Let me preface it by saying dolos is really awesome. It is really difficult to find any open source plagiarism detection software, especially one which complies a web based report. I wanted to pitch a feature request for dolos cli. The ability to specify a pre declared code template, which is ignored by the plagiarism checker. The use-case for this is during assignments, wherein a code template is specified, and students need to add on to the code snippet, only the most important logical section of the assignment. The code template (which should be ignored) usually consists of trivial components like class and function definitions.

@rien
Copy link
Member

rien commented Mar 8, 2022

This is completely doable and I seem to remember that this was implemented at some point before a major rewrite. We will consider adding this again in the future.

However, you should currently be able to use the -m and -M option to ignore fingerprints occurring in a lot of files:

The -m option sets the maximum number of times a given fingerprint may appear before it is ignored. A code fragment that appears in many programs is probably legitimate sharing and not the result of plagiarism. With -m N any fingerprint appearing in more than N programs is filtered out. This option has precedence over the -M option, which is set to 0.9 by default.

The -M option sets how many percent of the files the fingerprint may appear in before it is. With -M N any fingerprint appearing in more than N percent of the files is filtered out. Must be a value between 0 and 1. This option is ignored when comparing only two files, because each match appear in 100% of the files

@rien rien added the enhancement New feature or request label Nov 9, 2022
@tsieger
Copy link

tsieger commented Jan 25, 2023

I would like to vote for this feature request. It would be awesome to have direct possibility of ignoring teachers' code templates! (BTW, using -m and -M would filter out also e.g. shared code chunks which were not part of the template.)

@edeitrick
Copy link

Codio users are also requesting this

@rien
Copy link
Member

rien commented May 31, 2024

We have added experimental support for ignoring template code and frequent fingerprints in PR #1524

@rien rien closed this as completed May 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants