Classification Of Geographic Origin Of Rice By Data Mining And Inductively Coupled Plasma Mass Spectrometry

Keywords

Classification; Data mining; F-score; ICP-MS; Rice; Support vector machines

Abstract

Rice is one of the most consumed cereals in the world and the main food product in the diet of the Brazilian population. Brazil itself is among the ten largest producers of rice, and most of the harvest comes from the South and Midwest regions. This paper presents a data mining study of samples of rice obtained from producers in Goiás (Midwest region) and Rio Grande do Sul (South region), and builds classification models capable of predicting the geographical origin of a rice sample based on its chemical components. We use three popular classification techniques, support vector machines, random forests and neural networks, along with the F-score formula which measures the relative importance of the input variables. We achieved very good performances for the SVM, RF and MLP models with 93.66%, 93.83% and 90% prediction accuracy, respectively, on the 10-fold cross validation. The F-score shows that Cd(cadmium), Rb(rubidium), Mg(magnesium) and K(potassium) are the four most relevant components for prediction.

Publication Date

2-1-2016

Publication Title

Computers and Electronics in Agriculture

Volume

121

Number of Pages

101-107

Document Type

Article

Personal Identifier

scopus

DOI Link

https://doi.org/10.1016/j.compag.2015.11.009

Socpus ID

84951863017 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/84951863017

This document is currently not available here.

Share

COinS