Package: mldr.datasets 0.4.2

David Charte

mldr.datasets: R Ultimate Multilabel Dataset Repository

Large collection of multilabel datasets along with the functions needed to export them to several formats, to make partitions, and to obtain bibliographic information.

Authors:David Charte [cre], Francisco Charte [aut], Antonio J. Rivera [aut]

mldr.datasets_0.4.2.tar.gz
mldr.datasets_0.4.2.zip(r-4.5)mldr.datasets_0.4.2.zip(r-4.4)mldr.datasets_0.4.2.zip(r-4.3)
mldr.datasets_0.4.2.tgz(r-4.4-any)mldr.datasets_0.4.2.tgz(r-4.3-any)
mldr.datasets_0.4.2.tar.gz(r-4.5-noble)mldr.datasets_0.4.2.tar.gz(r-4.4-noble)
mldr.datasets_0.4.2.tgz(r-4.4-emscripten)mldr.datasets_0.4.2.tgz(r-4.3-emscripten)
mldr.datasets.pdf |mldr.datasets.html
mldr.datasets/json (API)

# Install 'mldr.datasets' in R:
install.packages('mldr.datasets', repos = c('https://fcharte.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/fcharte/mldr.datasets/issues

Datasets:
  • birds - Dataset with sounds produced by birds and the species they belong to
  • cal500 - Dataset with music data along with labels for emotions, instruments, genres, etc.
  • emotions - Dataset with features extracted from music tracks and the emotions they produce
  • flags - Dataset with features correspoinding to world flags
  • genbase - Dataset with genes data and their functional expression
  • langlog - Dataset with data from the Language forum discussion
  • medical - Dataset generated from medical reports
  • ng20 - Dataset with news messages and the news groups they belong to
  • slashdot - Dataset generated from slashdot.org site entries
  • stackex_chess - Dataset from the Stack Exchange's chess forum

On CRAN:

4.68 score 8 stars 120 scripts 562 downloads 1 mentions 67 exports 0 dependencies

Last updated 6 years agofrom:e65c9f8935. Checks:OK: 7. Indexed: yes.

TargetResultDate
Doc / VignettesOKOct 15 2024
R-4.5-winOKOct 15 2024
R-4.5-linuxOKOct 15 2024
R-4.4-winOKOct 15 2024
R-4.4-macOKOct 15 2024
R-4.3-winOKOct 15 2024
R-4.3-macOKOct 15 2024

Exports:available.mldrsbibtexbookmarkscheck_n_load.mldrcorel16k001corel16k002corel16k003corel16k004corel16k005corel16k006corel16k007corel16k008corel16k009corel16k010corel5kdeliciousdensityenroneurlexdc_testeurlexdc_traeurlexev_testeurlexev_traeurlexsm_testeurlexsm_traget.mldrimdbiterative.stratification.holdoutiterative.stratification.kfoldsiterative.stratification.partitionsmediamillmldrsnuswide_BoWnuswide_VLADohsumedrandom.holdoutrandom.kfoldsrandom.partitionsrcv1sub1rcv1sub2rcv1sub3rcv1sub4rcv1sub5reutersk500sparsitystackex_chemistrystackex_coffeestackex_cookingstackex_csstackex_philosophystratified.holdoutstratified.kfoldsstratified.partitionstmc2007tmc2007_500write.mldryahoo_artsyahoo_businessyahoo_computersyahoo_educationyahoo_entertainmentyahoo_healthyahoo_recreationyahoo_referenceyahoo_scienceyahoo_socialyahoo_societyyeast

Dependencies:

Readme and manuals

Help Manual

Help pageTopics
Obtain additional datasets available to downloadavailable.mldrs
Dataset with BibTeX entriesbibtex
Dataset with sounds produced by birds and the species they belong tobirds
Dataset with data from web bookmarks and their categoriesbookmarks
Dataset with music data along with labels for emotions, instruments, genres, etc.cal500
(Defunct) Check if an mldr object is locally available and download it if neededcheck_n_load.mldr
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k001
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k002
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k003
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k004
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k005
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k006
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k007
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k008
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k009
Datasets with data from the Corel image collection. There are 10 subsets in corel16kcorel16k010
Dataset with data from the Corel image collectioncorel5k
Dataset generated from the del.icio.us site bookmarksdelicious
Calculate the density level of the datasetdensity
Dataset with features extracted from music tracks and the emotions they produceemotions
Dataset with email messages and the folders where the users stored themenron
List with 10 folds of the test data from the EUR-Lex directory codes dataseteurlexdc_test
List with 10 folds of the train data from the EUR-Lex directory codes dataseteurlexdc_tra
List with 10 folds of the test data from the EUR-Lex EUROVOC descriptors dataseteurlexev_test
List with 10 folds of the train data from the EUR-Lex EUROVOC descriptors dataseteurlexev_tra
List with 10 folds of the test data from the EUR-Lex subject matters dataseteurlexsm_test
List with 10 folds of the train data from the EUR-Lex subject matters dataseteurlexsm_tra
Dataset with features correspoinding to world flagsflags
Dataset with genes data and their functional expressiongenbase
Get a multilabel dataset by nameget.mldr
Dataset generated from the IMDB film databaseimdb
Hold-out partitioning of an mldr objectiterative.stratification.holdout
Partition an mldr object into k foldsiterative.stratification.kfolds
Generic partitioning of an mldr objectiterative.stratification.partitions
Dataset with data from the Language forum discussionlanglog
Dataset with features extracted from video sequences and semantic concepts assigned as labelsmediamill
Dataset generated from medical reportsmedical
(Defunct) Obtain and show a list of additional datasets available to downloadmldrs
Dataset with news messages and the news groups they belong tong20
Dataset obtained from the NUS-WIDE database with BoW representationnuswide_BoW
Dataset obtained from the NUS-WIDE database with cVLAD+ representationnuswide_VLAD
Dataset generated from a subset of the Medline databaseohsumed
Hold-out partitioning of an mldr objectrandom.holdout
Partition an mldr object into k foldsrandom.kfolds
Generic partitioning of an mldr objectrandom.partitions
Dataset from the Reuters corpus (subset 1)rcv1sub1
Dataset from the Reuters corpus (subset 2)rcv1sub2
Dataset from the Reuters corpus (subset 3)rcv1sub3
Dataset from the Reuters corpus (subset 4)rcv1sub4
Dataset from the Reuters corpus (subset 5)rcv1sub5
Dataset from the Reuters Corpus with the 500 most relevant features selectedreutersk500
Dataset from images with different natural scenesscene
Dataset generated from slashdot.org site entriesslashdot
Calculate the sparsity level of the datasetsparsity
Dataset from the Stack Exchange's chemistry forumstackex_chemistry
Dataset from the Stack Exchange's chess forumstackex_chess
Dataset from the Stack Exchange's coffee forumstackex_coffee
Dataset from the Stack Exchange's cooking forumstackex_cooking
Dataset from the Stack Exchange's computer science forumstackex_cs
Dataset from the Stack Exchange's philosophy forumstackex_philosophy
Hold-out partitioning of an mldr objectstratified.holdout
Partition an mldr object into k foldsstratified.kfolds
Generic partitioning of an mldr objectstratified.partitions
Dataset from airplanes failures reportstmc2007
Dataset from airplanes failures reports (500 most relevant features extracted)tmc2007_500
BibTeX entry associated to an mldr objecttoBibtex.mldr
Export an mldr object or set of mldr objects to different file formatswrite.mldr
Dataset generated from the Yahoo! web site index (arts category)yahoo_arts
Dataset generated from the Yahoo! web site index (business category)yahoo_business
Dataset generated from the Yahoo! web site index (computers category)yahoo_computers
Dataset generated from the Yahoo! web site index (arts education)yahoo_education
Dataset generated from the Yahoo! web site index (arts entertainment)yahoo_entertainment
Dataset generated from the Yahoo! web site index (health category)yahoo_health
Dataset generated from the Yahoo! web site index (recreation category)yahoo_recreation
Dataset generated from the Yahoo! web site index (reference category)yahoo_reference
Dataset generated from the Yahoo! web site index (science category)yahoo_science
Dataset generated from the Yahoo! web site index (social category)yahoo_social
Dataset generated from the Yahoo! web site index (society category)yahoo_society
Dataset with protein profiles and their categoriesyeast