Changelog
Source:NEWS.md
mlr3oml 0.9.0-9000
- Bugfix: Caching of parquet files failed when the temporary directory was on a different partition as the cache directory
- feat: Add option
mlr3oml.retries
to control number of retries when downloading data from OpenML. The default is 3.
mlr3oml 0.9.0
CRAN release: 2023-09-13
- Fix: Parquet datasets now work where columns simultaneously have to be renamed and converted.
- Added upload functions:
-
publish_data
to upload a dataset on OpenML -
publish_task
to create a task on OpenML -
publish_collection
to create a collection on OpenML
-
mlr3oml 0.8.0
CRAN release: 2023-07-06
- Listing functions don’t return the tables invisibly anymore.
- Address CRAN NOTE regarding unused bit64 import.
- Improved the printer for all OpenML objects.
- Removed
benchmark_grid_oml()
, which was already deprecated in release 0.7.2. - Removed the fields
runs
,flows
,data
,tasks
from theOMLCollection
class. Consequently, thecache
option can no longer be set forOMLCollection
objects, see the class documentation for more information. - Removed the examples, as they caused problems with CRAN checks when OpenML was unavailable.
- Caching can no longer be specified at the instance level but only globally through the option
mlr3oml.cache
- Added
$download()
method for all OML objects to fully download an object for offline usage. - Incremented the cache version for parquet data due to a change in OpenML.
- Added an online tutorial for the package.
mlr3oml 0.7.2
CRAN release: 2023-06-12
- Fix: target is added to features when converting a
OMLData
object to a task with an explicit target variable that is not the default target. - Deprecated
benchmark_grid_oml()
in favour ofmlr3::benchmark_grid(..., paired = TRUE)
- Fix: Incremented cache version for data objects for int64 data types (introduced in the previous release).
- Fix: Incremented cache version for data description and fixed bug, as
make.names()
was not applied to ignore attributes. - Fix bug in task converter (features were sometimes not set correctly)
- Collection now shows name in printer
- Better error message when parquet dataset creation fails
mlr3oml 0.7.1
CRAN release: 2023-03-14
- Fixed argument names of S3 method for
as_data_backend
to comply with new CRAN checks
mlr3oml 0.7.0
CRAN release: 2022-12-12
- feature: Add argument
task_type
to functionlist_oml_tasks()
. - fix: strings and nominals are distinguished for parquet files
- docs: Fixed some OpenML links
- docs: Renamed the docs for OpenML objects
- Renamed the sugar functions from:
-
oml_data()
is nowodt()
-
oml_task()
is nowotsk()
-
oml_flow()
is nowoflw()
-
oml_run()
is noworn
-
oml_collection()
is nowocl()
-
- Addresses a CRAN issue: examples fail gracefully if OpenML server is busy.
mlr3oml 0.6.0
CRAN release: 2022-10-12
Features
- Add R6 classes for
OMLCollection
,OMLRun
,OMLFlow
. - Added function
benchmark_grid_oml
that allows for easier creation of benchmark designs from OpenML task-resampling pairs. - Added sugar functions
oml_flow
,oml_data
,oml_task
,oml_run
,oml_collection
for all OpenML objects. - Conversion from OpenML to mlr3 objects is now only possible with the usual s3-converters
as_<object>
. This improves consistency by ensuring that the subcomponents of OpenML objects are always OpenML objects and not suddenly mlr3 objects. - Added more converter functions:
as_learner
,as_resample_result
,as_data_backend
,as_benchmark_result
. - Added support for parquet files that were recently introduced on OpenML. The global option
mlr3oml.parquet
can be used to enable or disable this. By default it isFALSE
. This is implemented via the duckdb backend frommlr3db
. - Support to use the OpenML test server. This can be globally enabled using the option
mlr3oml.test_server
or individually for objects. Options to globally define an API-key for the test server are through the environment variableTESTOPENMLAPIKEY
or the optionmlr3oml.test_api_key
Fixes
- Removed support for survival tasks as mlr3proba is no longer on CRAN
- OpenML tasks can now also be filtered according to the task type
Other
- Implement an arff writer and remove the arff dependency, therefore also removing the option
"farff"
as themlr3oml.arff_parser
- Increment the cache version number due to changes in the cache structure: This will flush the previous cache folder.
- Simplified the code structure by adding
OMLObject
class from which all other OpenML objects likeOMLData
,OMLTask
inherit.
mlr3oml 0.5.0
CRAN release: 2021-09-24
- Support for downloading survival tasks (via
mlr3proba
). - More functions to list objects from OpenML:
mlr3oml 0.4.2
CRAN release: 2021-02-22
- If not set via option
mlr3oml.api_key
, the API key is retrieved from the environment variableOPENMLAPIKEY
. - Implemented a retry mechanism as a workaround for temporary connection errors.
mlr3oml 0.4.0
CRAN release: 2020-10-20
- The parsers for ARFF files can now be explicitly selected via option
"mlr3oml.arff.parser"
. Default is the internal parser based ondata.table::fread()
. - Improved stability of the internal ARFF parser in case of malformed ARFF files and non-standardized quotes.
mlr3oml 0.3.0
CRAN release: 2020-10-05
- The connectors used in
mlr_tasks
andmlr_resamplings
now signal errors of classmissingDefaultError
if some defaults are not set. - Target columns are now automatically converted to the require storage mode during task creation.
- Removed dependency on orphaned package
bibtex
.