This function allows to find tasks on https://openml.org/t using some simple filter criteria.

Note that only a subset of filters is exposed here. For a more feature-complete package, see OpenML.

list_oml_tasks(
  task_id = NULL,
  number_instances = NULL,
  number_features = NULL,
  number_classes = NULL,
  number_missing_values = NULL,
  tag = NULL,
  limit = 5000L,
  ...
)

Arguments

task_id

(integer())
Vector of task ids to restrict to.

number_instances

(integer())
Filter for number of instances.

number_features

(integer())
Filter for number of features.

number_classes

(integer())
Filter for number of labels of the target (only classification tasks).

number_missing_values

(integer())
Filter for number of missing values.

tag

(character())
Filter for specific tag. You can provide multiple tags as character vector.

limit

(integer())
Limit the results to limit records. Default is 5000.

...

(any)
Additional filters as named arguments.

Value

(data.table()) of results, or NULL if no data set matches the criteria.

Details

Filter values can be provided as single atomic values (typically integer or character). Provide a numeric vector of length 2 (c(l, u)) to find matches in the range \([l, u]\).

References

Casalicchio G, Bossek J, Lang M, Kirchhoff D, Kerschke P, Hofner B, Seibold H, Vanschoren J, Bischl B (2017). “OpenML: An R Package to Connect to the Machine Learning Platform OpenML.” Computational Statistics, 1--15. doi: 10.1007/s00180-017-0742-2 .

Vanschoren J, van Rijn JN, Bischl B, Torgo L (2014). “OpenML.” ACM SIGKDD Explorations Newsletter, 15(2), 49--60. doi: 10.1145/2641190.2641198 .

Examples

# \donttest{ list_oml_tasks(number_instances = 150, number_features = c(1, 10))
#> task_id task_type data_id name #> 1: 59 Supervised Classification 61 iris #> 2: 118 Learning Curve 61 iris #> 3: 289 Supervised Classification 61 iris #> 4: 1758 Learning Curve 61 iris #> 5: 1823 Supervised Classification 61 iris #> --- #> 471: 359608 Clustering 42261 iris-example #> 472: 359783 Clustering 42535 TEST10e627dcde-UploadTestWithURL #> 473: 359882 Clustering 42661 iris #> 474: 359912 Clustering 42699 iris #> 475: 359913 Clustering 42700 iris #> status MajorityClassSize MaxNominalAttDistinctValues MinorityClassSize #> 1: active 50 3 50 #> 2: active 50 3 50 #> 3: active 50 3 50 #> 4: active 50 3 50 #> 5: active 50 3 50 #> --- #> 471: active 50 NA 50 #> 472: active NA NA NA #> 473: active NA NA NA #> 474: active NA NA NA #> 475: active 50 NA 50 #> NumberOfClasses NumberOfFeatures NumberOfInstances #> 1: 3 5 150 #> 2: 3 5 150 #> 3: 3 5 150 #> 4: 3 5 150 #> 5: 3 5 150 #> --- #> 471: 3 5 150 #> 472: NA 5 150 #> 473: NA 5 150 #> 474: NA 5 150 #> 475: 3 5 150 #> NumberOfInstancesWithMissingValues NumberOfMissingValues #> 1: 0 0 #> 2: 0 0 #> 3: 0 0 #> 4: 0 0 #> 5: 0 0 #> --- #> 471: 0 0 #> 472: 0 0 #> 473: 0 0 #> 474: 0 0 #> 475: 0 0 #> NumberOfNumericFeatures NumberOfSymbolicFeatures #> 1: 4 1 #> 2: 4 1 #> 3: 4 1 #> 4: 4 1 #> 5: 4 1 #> --- #> 471: 4 1 #> 472: 4 1 #> 473: 4 0 #> 474: 4 1 #> 475: 4 1
# }