Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Khiops parameters to predictor constructors #71

Open
popescu-v opened this issue Jul 4, 2023 · 2 comments · May be fixed by #242
Open

Add Khiops parameters to predictor constructors #71

popescu-v opened this issue Jul 4, 2023 · 2 comments · May be fixed by #242
Assignees
Labels
Priority/0-High To do now Status/InDevelopment The issue is in development by one or more team members Type/Feature A new feature request or an improvement of a feature
Milestone

Comments

@popescu-v
Copy link
Collaborator

popescu-v commented Jul 4, 2023

Estimators of sklearn module (KhiopsClassifier, KhiopsRegressor and KhiopsEncoder) do not support all parameters that are available in the train_ functions of core module. The objective is therefore to extend their constructors by introducing the missing parameters.

  1. For KhiopsClassifier and KhiopsRegressor, the following parameters are going to be added:
  • max_evaluated_variables : int, default 0
    Maximum number of variables to be evaluated in the SNB predictor training. If equal to 0 it evaluates all informative variables.
  • max_selected_variables : int, default 0
    Maximum number of variables to be selected in the SNB predictor. If equal to 0 it selects all the variables kept in the
    training.
  • construction_rules : list of str, optional
    Allowed rules for the automatic variable construction. If not set it uses all possible rules.
  • specific_pairs : list of tuple, optional
    User-specified pairs as a list of 2-tuples of variable names. If a given tuple contains only one non-empty string generated
    within the maximum limit max_pairs.
  • all_possible_pairs : bool, default True
    If True tries to create all possible pairs within the limit max_pairs.
    The pairs and variables given in specific_pairs have priority.
  • group_target_value : bool, default False
    Allows grouping of the target variable values in classification. It can substantially increase the training time.
  1. For KhiopsEncoder, the following parameters are going to be added:
  • construction_rules : list of str, optional
    Allowed rules for the automatic variable construction. If not set it uses all possible rules.
  • specific_pairs : list of tuple, optional
    User-specified pairs as a list of 2-tuples of variable names. If a given tuple contains only one non-empty string generated
    within the maximum limit max_pairs.
  • all_possible_pairs : bool, default True
    If True tries to create all possible pairs within the limit max_pairs. The pairs and variables given in specific_pairs have priority.
  • group_target_value : bool, default False
    Allows grouping of the target variable values in classification. It can substantially increase the training time.
  • informative_variables_only : bool, default True
    If True keeps only informative variables.
  • pairs_recoding_method : str
    Type of recoding for bivariate variables. Types available:
    - "part Id" (default): An id for the interval/group
    - "part label": A label for the interval/group
    - "0-1 binarization": A 0's and 1's coding the interval/group id
    - "conditional info": Conditional information of the interval/group
    - "none": Keeps the variable as-is
@popescu-v popescu-v added Status/Draft The issue is still not well defined Type/Feature A new feature request or an improvement of a feature labels Jul 4, 2023
@popescu-v
Copy link
Collaborator Author

  • only few parameters are shown to the user compared to core.

1 similar comment
@popescu-v
Copy link
Collaborator Author

  • only few parameters are shown to the user compared to core.

@folmos-at-orange folmos-at-orange added this to the 11.0.0 milestone Sep 22, 2023
@folmos-at-orange folmos-at-orange added Status/InDevelopment The issue is in development by one or more team members and removed Status/Draft The issue is still not well defined labels Sep 10, 2024
@nairbenrekia nairbenrekia linked a pull request Sep 10, 2024 that will close this issue
@popescu-v popescu-v modified the milestones: 11.0.0.0, 10.2.2.5 Sep 24, 2024
@popescu-v popescu-v added this to the 10.2.3.1 milestone Oct 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Priority/0-High To do now Status/InDevelopment The issue is in development by one or more team members Type/Feature A new feature request or an improvement of a feature
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants