Commits · 87fb436ccf50ef1b76d5e74a07d85422eb000349 · Luc Giffon / bolsonaro

Mar 25, 2020

Fix flw_pairs loading. Prepare all new exps: omp_distillation, preds... · 87fb436c

Charly Lamothe authored 4 years ago

Fix flw_pairs loading. Prepare all new exps: omp_distillation, preds coherence, preds correlation, normalize_D when OMP, n_jobs=-1 in SOTA. In exps script, test both train+dev,train+dev and train,dev

87fb436c

Mar 24, 2020
- fix and optimise ensemble selection forest regressor · 8d52496f
  Luc Giffon authored 4 years ago
  
  8d52496f
- Handle similarity_similarities and similarity_predictions in the pipeline and... · 5ee9422b
  Charly Lamothe authored 4 years ago
  
  Handle similarity_similarities and similarity_predictions in the pipeline and set lfw_pairs to binary classif (todo: change the labels for omp)
  5ee9422b
Mar 13, 2020
- Merge from master · 4d4c0848
  Charly Lamothe authored 5 years ago
  
  4d4c0848
Mar 12, 2020
- Fix on resume mode for extracted forest size job · 9d7ef0e7
  Charly Lamothe authored 5 years ago
  
  9d7ef0e7
Mar 06, 2020
- Correction on random extraction · 6483c0dc
  Léo Bouscarrat authored 5 years ago
  
  6483c0dc
- Update fix random strategy (wip · 138660cb
  Charly Lamothe authored 5 years ago
  
  138660cb
- Fix hyperparams bugs in base and random. Fix extracted forest size used in... · 2a265135
  Charly Lamothe authored 5 years ago
  
  Fix hyperparams bugs in base and random. Fix extracted forest size used in random. Factorize random fitting
  2a265135
- Integrate Paolo's code of method 'Ensemble selection from libraries of models'... · 1194ee2f
  Charly Lamothe authored 5 years ago
  
  Integrate Paolo's code of method 'Ensemble selection from libraries of models' by Rich Caruana et al
  1194ee2f
- Speedup similarity forest regressor and add parallelization at the extracted... · 29a11860
  Charly Lamothe authored 5 years ago
  
  Speedup similarity forest regressor and add parallelization at the extracted forest size level in the training
  29a11860
- Fix parallelization and estimator default hyperparams in kmeans and similarity... · 0a97ff64
  Charly Lamothe authored 5 years ago
  
  Fix parallelization and estimator default hyperparams in kmeans and similarity methods. Fix on resume mode in train.py. Fix stage5 saving (tmp) in compute_results.py
  0a97ff64
- Add on resume mode for the experiment training (and set the overwrite of the... · be5bc24a
  Charly Lamothe authored 5 years ago
  
  Add on resume mode for the experiment training (and set the overwrite of the resulting model of the experiment optional)
  be5bc24a
Mar 05, 2020
- Changing endpoint linspace · 006b001e
  Léo Bouscarrat authored 5 years ago
  
  006b001e
Feb 28, 2020
- Finish to add similarity method to the pipeline. Add kmeans pruning method · 59e65276
  Charly Lamothe authored 5 years ago
  
  59e65276
Feb 04, 2020

Add Paolo's first implementation of this paper:... · c80ddd61

Charly Lamothe authored 5 years ago

Add Paolo's first implementation of this paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2822360/

c80ddd61

Jan 09, 2020
- - Add a temp fix for the subset used in base and random strategies; · 21ccc627
  Charly Lamothe authored 5 years ago
  
  - Add new results for stage4.
  21ccc627
Jan 08, 2020
- - Add diamonds results; · 9ce94798
  Charly Lamothe authored 5 years ago
  
  - Add stage4 (results and experiments); - Do not save model object.
  9ce94798
Dec 29, 2019

- Remove extracted_forest_sizes_number parameter from compute_results.py and... · 559d73c0

Charly Lamothe authored 5 years ago

- Remove extracted_forest_sizes_number parameter from compute_results.py and retreive the value instead;
- Add almost all remaining experiment config files of stages 1, 2 and 3;
- Add almost all remaining result plots of stages 1, 2 and 3;
- Add some temporary scripts to run all stages experiments.

559d73c0

Dec 26, 2019

- Add code for stages 2 and 3 results; · 8de5e96a

Charly Lamothe authored 5 years ago

- Add command lines example for stage 3;
- Add experiment_id option that is useful sometimes;
- Fix subsets_used param;
- Remove experiment_id in config experiment file names;
- Add config experiment files for stages 2 and 3;
- Add results for stages 2 and 3 (california_housing).

8de5e96a

- Add command lines for stage2 experiments; · 58061ea4

Charly Lamothe authored 5 years ago

- Fix possible issues for extracted forest sizes computation: around to reduce possible zeroes and remove duplicates;
- Create output experiment stage dir if not exists;
- Add base_score_metric to model raw results class;
- Add best params for lfw_pairs (maybe try with a larger number of random seeds since the score is not that high).

58061ea4

Dec 20, 2019

- Unignore results; · 51ba8a0e

Charly Lamothe authored 5 years ago

- Even if hyperparameters file is ignore with skip_best_hyperparams option, still use the same forest_size to be comparable;
- Update experiment files for stage1 wo_param experiments (using the same forest size as the with_params experiments);
- In compute_results: remove useless folder creation; temporary add extracted_forest_sizes_number option to specify the extracted forest sizes number; temporary not plotting train and dev losses in stage1 loss values figure;
- In plotter, clean-up stage1 figure generation;
- Add first unbiased losses plot (stage1: best params vs default params in california housing dataset).

51ba8a0e

Dec 19, 2019

- Finish first stable version of stage1 plotting; · 649b4e64
Charly Lamothe authored 5 years ago
```
- Fix some variable names;
- Add exp files of stage1 for california housing
```
649b4e64

- Reduce the extracted forest sizes upper bound and number because OMP seems... · 6a6cf747

Charly Lamothe authored 5 years ago

- Reduce the extracted forest sizes upper bound and number because OMP seems to converge only with small forest sizes;
- Add extraction_strategy parameter in order to save base forest and the forests trained with the same size as the extracted forest sizes used in the experiment that used OMP.

6a6cf747

Dec 18, 2019

POC of possible wrong way to compute best hyperparams. Are there the best only... · fd6dbc7b
Charly Lamothe authored 5 years ago
```
POC of possible wrong way to compute best hyperparams. Are there the best only before the application of OMP extraction?
```
fd6dbc7b

- Add an option to not use the best hyperparameters file; · 880ff78f

Charly Lamothe authored 5 years ago

- Definitely use the correct forest size (either the one from best hyperparameters or the one specified in parameter);
- Use a number of extracted forest sizes proportional as the forest size instead of fixed forest size;
- Add an option to save the current command line name instead of using the unamed directory;
- Add new california housing dataset best hyperparameters, and convert all value types that are number from string to int/float in other best hyperparameter files;
- Remove useless code from compute_results.py in prevision of the changes;
- Before best hyperparameters saving, save number as int or float instead of string;
- Add job_number option for parallelisation in both train.py and compute_hyperparameters.py scripts;
- Clean-up TODO list.

880ff78f

- Replace the futures concurrence by joblib Parallel (and add optional tqdm progress bar); · 11017545
Charly Lamothe authored 5 years ago
```
- Add new best params for 7 datasets.
```
11017545

Dec 01, 2019

- Add standard dataset scaling for dataset normalization; · f61314f3

Charly Lamothe authored 5 years ago

- Ignore unamed experiment configuration file backups;
- Factorize default dataset loading parameters;
- Add missing return_X_y in basic dataset loaders.

f61314f3

Nov 22, 2019
- - Split train function in three distinct functions; · a0f7c96f
  Charly Lamothe authored 5 years ago
  
  - Update TODO list.
  a0f7c96f
- When training, look if there is bayesian search results, if yes use this.... · c66d117d
  Léo Bouscarrat authored 5 years ago
  
  When training, look if there is bayesian search results, if yes use this. Exception: forest_size use the one given by parser if applicable
  c66d117d
- add multiclass classifier mais attention ya un bug dans le calcul du score · 5e50bbaa
  Luc Giffon authored 5 years ago
  
  5e50bbaa
Nov 21, 2019

Big changes: Create intermediate classes OMPForest and SingleOmpForest for... · 3f5cdf68

Luc Giffon authored 5 years ago

Big changes: Create intermediate classes OMPForest and SingleOmpForest for code factoring: share code between OmpForestRegressor and OmpForestBinaryClassifer. Remove set_wweights and set_forest which are not relevant anymore. load function from model_factory isn't trustfull now: raises an error. TODO: multiclass classifier

3f5cdf68

Nov 20, 2019
- Add functions to do bayesian hyperparameters search · bf5803b6
  Léo Bouscarrat authored 5 years ago
  
  bf5803b6
Nov 09, 2019
- Fix a spelling mistake in train.py. · 211dc83a
  Charly LAMOTHE authored 5 years ago
  
  211dc83a
- - Add experiment_configuration parameter to run an experiment from a json... · 789a11a6
  Charly LAMOTHE authored 5 years ago
  
  - Add experiment_configuration parameter to run an experiment from a json configuration file. If the experiment configuration are commnig from the arguments, save it to a file to keep trace of it; - Add few comments in train.py.
  789a11a6
- - Compute each computations of a given seed in a dedicated job; · cb0030d8
  Charly LAMOTHE authored 5 years ago
  
  - Use as much CPU as possible when training a random forest regressor.
  cb0030d8
Nov 08, 2019
- Add the weights normalization parameter (but not implemented yet) · 0fce0319
  Charly LAMOTHE authored 5 years ago
  
  0fce0319
Nov 06, 2019

Replace use_dev_subset by subsets_used parameter, in order to specify more... · 7455fd98

Charly LAMOTHE authored 5 years ago

Replace use_dev_subset by subsets_used parameter, in order to specify more clearly which combination of train dev to used to train the forest and OMP

7455fd98

Nov 05, 2019
- Train the forest on train and OMP on dev OR train both the forest and OMP on train+dev · 28b804c6
  Charly LAMOTHE authored 5 years ago
  
  28b804c6
- Add an option to disable the progress bars · 9199d9bb
  Charly LAMOTHE authored 5 years ago
  
  9199d9bb
- Uppercased default consts · 306656fc
  Charly LAMOTHE authored 5 years ago
  
  306656fc