当前位置:文档之家› uci数据集大致情况翻译

uci数据集大致情况翻译

uci数据集大致情况翻译来源:/ml/datasets.html?format=&task=&att=&area=&numAtt=&n umIns=&type=&sort=nameUp&view=listTable View List View 206 Data Sets1. Abalone: Predict the age of abalone from physical measurements 鲍鱼DataSet:根据物理度量,预测鲍鱼的年龄。

2. Abscisic Acid Signaling Network: The objective is to determine the set ofboolean rules that describe the interactions of the nodes within this plantsignaling network. The dataset includes 300 separate boolean pseudodynamic simulations using an asynchronous update scheme.目标是测定布尔值的度量集合,以描述植物的信号网路节点。

该数据集包括了300个独立的布尔值形式的虚拟动态模拟值,使用了异步更新的架构。

3. Acute Inflammations: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system.急性炎症DataSet:数据来源于一位医学专家的数据集,用以检测专家系统,可以推断出泌尿系统的两种疾病的诊断结果。

4. Adult: Predict whether income exceeds $50K/yr based on census data. Also known as \成人DataSet:根据户口普查资料,预测收入是否能超过50000美元/年。

通常也被称为“收入普查”数据集。

5. Annealing: Steel annealing data 退火DataSet:训练退火数据。

6. Anonymous Microsoft Web Data: Log of anonymous users of; predict areas of the web site a user visited based on data on other areas the user visited.匿名微软网络数据:微软网站的匿名用户记录;通过其他的用户访问区域数据,预测用户在web站点的访问区域。

7. Arcene: ARCENE's task is to distinguish cancer versus normal patterns from mass-spectrometric data. This is a two-class classification problem withcontinuous input variables. This dataset is one of 5 datasets of the NIPS 2021 feature selection challenge.ArceneDataSet:该数据集的任务是根据大量的观测数据,从正常的模式中辨别出癌症。

这是一个根据不断输入的变量的二级分类问题。

该数据集是从NIPS2021特征选择挑战比赛中的5个数据集之一。

8. Arrhythmia: Distinguish between the presence and absence of cardiac arrhythmia and classify it in one of the 16 groups.心率失常DataSet:分辨是否出现心率失常,并将结果分类进16个组之一。

9. Artificial Characters: Dataset artificially generated by using first order theory which describes structure of ten capital letters of English alphabet 人为性状DataSet:通过使用第一次序理论(该理论可以描述出英语字母表的十个开头字母的结构),自动生成的数据集。

10. Audiology (Original): Nominal audiology dataset from Baylor 原始AudiologyDataSet:来自Baylor的标称型的audiology数据集。

11. Audiology (Standardized): Standardized version of the original audiology database标准AudiologyDataSet:原始Audiology数据集的标准化版本。

12. Australian Sign Language signs: This data consists of sample of Auslan (Australian Sign Language) signs. Examples of 95 signs were collected fromfive signers with a total of 6650 sign samples.澳大利亚标记语言标记DataSet:这些数据包括了澳大利亚标记语言标记的样本。

95个实例,均来自五个标识器,其中有6650个标记样本。

13. Australian Sign Language signs (High Quality): This data consists of sample of Auslan (Australian Sign Language) signs. 27 examples of each of 95 Auslan signs were captured from a native signer using high-quality position trackers澳大利亚标记语言标记DataSet高品质版:该数据集包含了Auslan标记的样本。

有27个实例,它们来自95个标记,这27个实例是使用高质量位置追踪器的当地标识器捕捉出来的。

14. Auto MPG: Revised from CMU StatLib library, data concerns city-cyclefuel consumption自动MPGDataSet:来自CMU StatLib实验室的精品,是与城市循环能源消耗相关的数据集。

15. Automobile: From 1985 Ward's Automotive Yearbook 汽车DataSet:来自1985的沃德自动化年鉴。

16. AutoUniv: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of real data. Data can be generated in .csv, ARFF or C4.5 formats.AutoUniv是一个高级数据生成器,可以用来处理分类任务。

目标是反映现实数据的微妙与不同之处。

数据可以在.csv中生成,采用ARFF或者C4.5的格式。

17. Bach Chorales: Time-series data based on chorales; challenge is to learn generative grammar; data in Lisp基于Chorales的时间序列数据集;可以用来挑战生成性的语法;数据放在Lisp中。

18. Badges: Badges labeled with a \徽章DataSet:标记了“+”或“-”的符号的标记,可以作为一个人姓名的函数表达式。

19. Bag of Words: This data set contains five text collections in the form of bags-of-words.词语包DataSet:该数据集包含了5个文本集合,每个文本集合以词语包的形式展现。

20. Balance Scale: Balance scale weight & distance database 天平DataSet:天平的重量和距离数据库。

21. Balloons: Data previously used in cognitive psychology experiment; 4 data sets represent different conditions of an experiment气球DataSet:曾经用在认知心理学实验中的数据;4个数据集代表了一个实验中的不同条件。

22. Blood Transfusion Service Center: Data taken from the BloodTransfusion Service Center in Hsin-Chu City in Taiwan -- this is a classificationproblem.输血服务中心DataSet:来自台湾的Hsin-CHu市的输血服务中心的数据――用以解决分类问题。

23. Breast Cancer: Breast Cancer Data (Restricted Access) 乳腺癌DataSet:乳腺癌数据(访问限制)。

24. Breast Cancer Wisconsin (Diagnostic): Diagnostic Wisconsin Breast Cancer Database乳腺癌威斯康星洲(诊断数据)DataSet:威斯康星的乳腺癌诊断数据。

25. Breast Cancer Wisconsin (Original): Original Wisconsin Breast Cancer Database乳腺癌威斯康星洲(原始数据):原始的威斯康星州乳腺癌数据库。

相关主题