Machine learning for Big Data analytics in plants

Chuang Ma, Hao Zhang, Xiangfeng Wang

Research output: Contribution to journalArticle

63 Citations (Scopus)

Abstract

Rapid advances in high-throughput genomic technology have enabled biology to enter the era of 'Big Data' (large datasets). The plant science community not only needs to build its own Big-Data-compatible parallel computing and data management infrastructures, but also to seek novel analytical paradigms to extract information from the overwhelming amounts of data. Machine learning offers promising computational and analytical solutions for the integrative analysis of large, heterogeneous and unstructured datasets on the Big-Data scale, and is gradually gaining popularity in biology. This review introduces the basic concepts and procedures of machine-learning applications and envisages how machine learning could interface with Big Data technology to facilitate basic research and biotechnology in the plant sciences.

Original languageEnglish (US)
Pages (from-to)798-808
Number of pages11
JournalTrends in Plant Science
Volume19
Issue number12
DOIs
StatePublished - Dec 1 2014

Fingerprint

artificial intelligence
Biological Sciences
infrastructure
biotechnology
genomics

Keywords

  • Big Data
  • Large-scale datasets
  • Machine learning
  • Plants

ASJC Scopus subject areas

  • Plant Science

Cite this

Machine learning for Big Data analytics in plants. / Ma, Chuang; Zhang, Hao; Wang, Xiangfeng.

In: Trends in Plant Science, Vol. 19, No. 12, 01.12.2014, p. 798-808.

Research output: Contribution to journalArticle

Ma, Chuang ; Zhang, Hao ; Wang, Xiangfeng. / Machine learning for Big Data analytics in plants. In: Trends in Plant Science. 2014 ; Vol. 19, No. 12. pp. 798-808.
@article{725299bde9ef4be58909fef43d96f0a9,
title = "Machine learning for Big Data analytics in plants",
abstract = "Rapid advances in high-throughput genomic technology have enabled biology to enter the era of 'Big Data' (large datasets). The plant science community not only needs to build its own Big-Data-compatible parallel computing and data management infrastructures, but also to seek novel analytical paradigms to extract information from the overwhelming amounts of data. Machine learning offers promising computational and analytical solutions for the integrative analysis of large, heterogeneous and unstructured datasets on the Big-Data scale, and is gradually gaining popularity in biology. This review introduces the basic concepts and procedures of machine-learning applications and envisages how machine learning could interface with Big Data technology to facilitate basic research and biotechnology in the plant sciences.",
keywords = "Big Data, Large-scale datasets, Machine learning, Plants",
author = "Chuang Ma and Hao Zhang and Xiangfeng Wang",
year = "2014",
month = "12",
day = "1",
doi = "10.1016/j.tplants.2014.08.004",
language = "English (US)",
volume = "19",
pages = "798--808",
journal = "Trends in Plant Science",
issn = "1360-1385",
publisher = "Elsevier Limited",
number = "12",

}

TY - JOUR

T1 - Machine learning for Big Data analytics in plants

AU - Ma, Chuang

AU - Zhang, Hao

AU - Wang, Xiangfeng

PY - 2014/12/1

Y1 - 2014/12/1

N2 - Rapid advances in high-throughput genomic technology have enabled biology to enter the era of 'Big Data' (large datasets). The plant science community not only needs to build its own Big-Data-compatible parallel computing and data management infrastructures, but also to seek novel analytical paradigms to extract information from the overwhelming amounts of data. Machine learning offers promising computational and analytical solutions for the integrative analysis of large, heterogeneous and unstructured datasets on the Big-Data scale, and is gradually gaining popularity in biology. This review introduces the basic concepts and procedures of machine-learning applications and envisages how machine learning could interface with Big Data technology to facilitate basic research and biotechnology in the plant sciences.

AB - Rapid advances in high-throughput genomic technology have enabled biology to enter the era of 'Big Data' (large datasets). The plant science community not only needs to build its own Big-Data-compatible parallel computing and data management infrastructures, but also to seek novel analytical paradigms to extract information from the overwhelming amounts of data. Machine learning offers promising computational and analytical solutions for the integrative analysis of large, heterogeneous and unstructured datasets on the Big-Data scale, and is gradually gaining popularity in biology. This review introduces the basic concepts and procedures of machine-learning applications and envisages how machine learning could interface with Big Data technology to facilitate basic research and biotechnology in the plant sciences.

KW - Big Data

KW - Large-scale datasets

KW - Machine learning

KW - Plants

UR - http://www.scopus.com/inward/record.url?scp=84915785146&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84915785146&partnerID=8YFLogxK

U2 - 10.1016/j.tplants.2014.08.004

DO - 10.1016/j.tplants.2014.08.004

M3 - Article

AN - SCOPUS:84915785146

VL - 19

SP - 798

EP - 808

JO - Trends in Plant Science

JF - Trends in Plant Science

SN - 1360-1385

IS - 12

ER -