Changelog
Source:NEWS.md
mlr3oml (development version)
New Features
- Added
list_oml_collections()to list collections (run or task collections) on OpenML.
mlr3oml 0.10.0
CRAN release: 2024-06-04
- Bugfix: Caching of parquet files failed when the temporary directory was on a different partition as the cache directory
- feat: Add option
mlr3oml.retriesto control number of retries when downloading data from OpenML. The default is 3.
mlr3oml 0.9.0
CRAN release: 2023-09-13
- Fix: Parquet datasets now work where columns simultaneously have to be renamed and converted.
- Added upload functions:
-
publish_datato upload a dataset on OpenML -
publish_taskto create a task on OpenML -
publish_collectionto create a collection on OpenML
-
mlr3oml 0.8.0
CRAN release: 2023-07-06
- Listing functions don’t return the tables invisibly anymore.
- Address CRAN NOTE regarding unused bit64 import.
- Improved the printer for all OpenML objects.
- Removed
benchmark_grid_oml(), which was already deprecated in release 0.7.2. - Removed the fields
runs,flows,data,tasksfrom theOMLCollectionclass. Consequently, thecacheoption can no longer be set forOMLCollectionobjects, see the class documentation for more information. - Removed the examples, as they caused problems with CRAN checks when OpenML was unavailable.
- Caching can no longer be specified at the instance level but only globally through the option
mlr3oml.cache - Added
$download()method for all OML objects to fully download an object for offline usage. - Incremented the cache version for parquet data due to a change in OpenML.
- Added an online tutorial for the package.
mlr3oml 0.7.2
CRAN release: 2023-06-12
- Fix: target is added to features when converting a
OMLDataobject to a task with an explicit target variable that is not the default target. - Deprecated
benchmark_grid_oml()in favour ofmlr3::benchmark_grid(..., paired = TRUE) - Fix: Incremented cache version for data objects for int64 data types (introduced in the previous release).
- Fix: Incremented cache version for data description and fixed bug, as
make.names()was not applied to ignore attributes. - Fix bug in task converter (features were sometimes not set correctly)
- Collection now shows name in printer
- Better error message when parquet dataset creation fails
mlr3oml 0.7.1
CRAN release: 2023-03-14
- Fixed argument names of S3 method for
as_data_backendto comply with new CRAN checks
mlr3oml 0.7.0
CRAN release: 2022-12-12
- feature: Add argument
task_typeto functionlist_oml_tasks(). - fix: strings and nominals are distinguished for parquet files
- docs: Fixed some OpenML links
- docs: Renamed the docs for OpenML objects
- Renamed the sugar functions from:
-
oml_data()is nowodt() -
oml_task()is nowotsk() -
oml_flow()is nowoflw() -
oml_run()is noworn -
oml_collection()is nowocl()
-
- Addresses a CRAN issue: examples fail gracefully if OpenML server is busy.
mlr3oml 0.6.0
CRAN release: 2022-10-12
Features
- Add R6 classes for
OMLCollection,OMLRun,OMLFlow. - Added function
benchmark_grid_omlthat allows for easier creation of benchmark designs from OpenML task-resampling pairs. - Added sugar functions
oml_flow,oml_data,oml_task,oml_run,oml_collectionfor all OpenML objects. - Conversion from OpenML to mlr3 objects is now only possible with the usual s3-converters
as_<object>. This improves consistency by ensuring that the subcomponents of OpenML objects are always OpenML objects and not suddenly mlr3 objects. - Added more converter functions:
as_learner,as_resample_result,as_data_backend,as_benchmark_result. - Added support for parquet files that were recently introduced on OpenML. The global option
mlr3oml.parquetcan be used to enable or disable this. By default it isFALSE. This is implemented via the duckdb backend frommlr3db. - Support to use the OpenML test server. This can be globally enabled using the option
mlr3oml.test_serveror individually for objects. Options to globally define an API-key for the test server are through the environment variableTESTOPENMLAPIKEYor the optionmlr3oml.test_api_key
Fixes
- Removed support for survival tasks as mlr3proba is no longer on CRAN
- OpenML tasks can now also be filtered according to the task type
Other
- Implement an arff writer and remove the arff dependency, therefore also removing the option
"farff"as themlr3oml.arff_parser - Increment the cache version number due to changes in the cache structure: This will flush the previous cache folder.
- Simplified the code structure by adding
OMLObjectclass from which all other OpenML objects likeOMLData,OMLTaskinherit.
mlr3oml 0.5.0
CRAN release: 2021-09-24
- Support for downloading survival tasks (via
mlr3proba). - More functions to list objects from OpenML:
mlr3oml 0.4.2
CRAN release: 2021-02-22
- If not set via option
mlr3oml.api_key, the API key is retrieved from the environment variableOPENMLAPIKEY. - Implemented a retry mechanism as a workaround for temporary connection errors.
mlr3oml 0.4.0
CRAN release: 2020-10-20
- The parsers for ARFF files can now be explicitly selected via option
"mlr3oml.arff.parser". Default is the internal parser based ondata.table::fread(). - Improved stability of the internal ARFF parser in case of malformed ARFF files and non-standardized quotes.
mlr3oml 0.3.0
CRAN release: 2020-10-05
- The connectors used in
mlr_tasksandmlr_resamplingsnow signal errors of classmissingDefaultErrorif some defaults are not set. - Target columns are now automatically converted to the require storage mode during task creation.
- Removed dependency on orphaned package
bibtex.