Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is the val_set kwarg within train_ensemble used to affect the fitting process in any way? #28

Closed
sgbaird opened this issue Mar 9, 2022 · 2 comments

Comments

@sgbaird
Copy link

sgbaird commented Mar 9, 2022

(e.g. learning rate scheduler, internal optimizer, hyperparameter optimization) See also anthony-wang/CrabNet#15

Moved into its own issue from #25 (comment)

@sgbaird sgbaird changed the title Is the val_set kwarg within train_ensemble used to affect the predictions in any way? Is the val_set kwarg within train_ensemble used to affect the fitting process in any way? Mar 9, 2022
@CompRhys
Copy link
Owner

CompRhys commented Mar 9, 2022

Tldr; not in the examples scripts but it was used for proper held out tuning when developing the model.

On this I initially was using a validation set to tune roost but after I moved from small datasets where we were performing equivalently to Magpie+RF to OQMD where we saw a huge improvement over ElemNet I fixed the architecture and optimization settings to focus on other aspects such as uncertainty and ensembles and stopped holding out the extra val data.

It would be good to be more explicit about hyperparameter choices.

We used to keep the best model checkpoint according to the val loss but stopped that due to the risk of people using that functionality to “cheat” if passing the test set as the val set.

@CompRhys
Copy link
Owner

happy to close?

@sgbaird sgbaird closed this as completed Mar 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants