Please use this identifier to cite or link to this item:
https://hdl.handle.net/10321/4120
Title: | Qualitative classification of sugar processing stream products by near infrared spectroscopy | Authors: | Nadar, Rowena | Keywords: | Classification;Sugar processing stream products;Near Infrared Spectroscopy | Issue Date: | 2021 | Abstract: | The Sugar Milling Research Institute NPC (SMRI) is an integral and essential part of the sugar industry as it provides a quality control service among other consultation services to sugar mills in South Africa and other parts of Africa. SMRI uses various prediction equations with near infrared spectroscopy (NIRS), in transmission mode, to predict analyte concentrations present in the various sugar stream products. In this study, chemometrics was used to develop a classification model using discriminant analysis, which could be applied to the process analysis to choose the correct prediction equation for a specific sugar stream product. Samples were selected based on various geographical and environmental factors to ensure variability between the samples. Two different types of data sets were explored to determine the best classification model. The first method used the spectral data of absorbance and wavelength of each sample: Pre-processing was carried out to eliminate any scattering effects. Principal component analysis (PCA) was then applied to reduce the data so that only the necessary information remained. Various classification models, namely, K-nearest neighbour (KNN), Classification tree, Support vector machine (SVM), and Logistic regression, were tested and validated by comparing the predicted sample types against actual sample types. Results showed that the KNN (3) model with the Savitzky Golay filter and three principal components (PCs) provided the best separation between the various sugar stream products. The second method used the analyte concentrations for pol (apparent sucrose content), Brix (total dissolved solids), sucrose, fructose, glucose, and ash for the various sugar stream products. These results were standardised before PCA was applied. The same classification models were applied, tested, and validated using actual samples. These results showed that the Logistic regression model with two PCs performed best. The optimum model from each investigation was compared against each other by evaluating the performance measures of the two models. Based on the analyte concentration data, the Logistic regression (lasso) model with two PCs provided the best separation between sugar stream products. The F1 scores and classification accuracies determined this for the calibration and independent validation sample data set, which were 99.4 and 100 %, respectively. |
Description: | Submitted in fulfilment of the requirements for the Degree of Master of Applied Science in Chemistry, Durban University of Technology, 2022. |
URI: | https://hdl.handle.net/10321/4120 | DOI: | https://doi.org/10.51415/10321/4120 |
Appears in Collections: | Theses and dissertations (Applied Sciences) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Nadar_R_ 2022.pdf | Thesis | 11.17 MB | Adobe PDF | View/Open |
Page view(s)
322
checked on Nov 21, 2024
Download(s)
142
checked on Nov 21, 2024
Google ScholarTM
Check
Altmetric
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.