<rdf:RDF xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <rdf:Description rdf:about="https://doi.org/10.1016/j.envpol.2021.118128">
    <dct:isReferencedBy>OPENAIRE</dct:isReferencedBy>
    <dct:isReferencedBy>OpenAire</dct:isReferencedBy>
    <dct:isReferencedBy>Datacite</dct:isReferencedBy>
    <dct:isReferencedBy>Sygma</dct:isReferencedBy>
    <dct:isReferencedBy>Ghent University Academic Bibliography</dct:isReferencedBy>
    <dct:isReferencedBy>ZENODO</dct:isReferencedBy>
    <dct:isReferencedBy>Crossref</dct:isReferencedBy>
    <dct:isReferencedBy>Microsoft Academic Graph</dct:isReferencedBy>
    <dct:isReferencedBy>Europe PubMed Central</dct:isReferencedBy>
    <dct:isReferencedBy>European Union Open Data Portal</dct:isReferencedBy>
    <dct:isPartOf>Environmental Pollution</dct:isPartOf>
    <dct:license>Open Access</dct:license>
    <dct:created>2021-09-09</dct:created>
    <dc:description>Previous studies have mostly focused on using visible-to-near-infrared spectral technique to quantitatively estimate soil cadmium (Cd) content, whereas little attention has been paid to identifying soil Cd contamination from a perspective of spectral classification. Here, we developed a framework to compare the potential of two spectral transformations (i.e., raw reflectance and continuum removal [CR]), three optimization strategies (i.e., full-spectrum, Boruta feature selection, and synthetic minority over-sampling technique [SMOTE]), and three classification algorithms (i.e., partial least squares discriminant analysis, random forest [RF], and support vector machine) for diagnosing soil Cd contamination. A total of 536 soil samples were collected from urban and suburban areas located in Wuhan City, China. Specifically, Boruta and SMOTE strategies were aimed at selecting the most informative predictors and obtaining balanced training datasets, respectively. Results indicated that soils contaminated by Cd induced decrease in spectral reflectance magnitude. Classification models developed after Boruta and SMOTE strategies out-performed to those from full-spectrum. A diagnose model combining CR preprocessing, SMOTE strategy, and RF algorithm achieved the highest validation accuracy for soil Cd (Kappa = 0.74). This study provides a theoretical reference for rapid identification of and monitoring of soil Cd contamination in urban and suburban areas.</dc:description>
    <dc:subject>DIFFUSE-REFLECTANCE SPECTROSCOPY</dc:subject>
    <dc:subject>HUMAN HEALTH</dc:subject>
    <dc:subject>PREDICTION</dc:subject>
    <dc:subject>POTENTIALLY TOXIC ELEMENTS</dc:subject>
    <dc:subject>Boruta algorithm</dc:subject>
    <dc:subject>01 natural sciences</dc:subject>
    <dc:subject>Visible-to-near-infrared spectroscopy</dc:subject>
    <dc:subject>NIR SPECTROSCOPY</dc:subject>
    <dc:subject>Soil</dc:subject>
    <dc:subject>ORGANIC-CARBON</dc:subject>
    <dc:subject>Machine learning</dc:subject>
    <dc:subject>11. Sustainability</dc:subject>
    <dc:subject>Soil Pollutants</dc:subject>
    <dc:subject>Least-Squares Analysis</dc:subject>
    <dc:subject>0105 earth and related environmental sciences</dc:subject>
    <dc:subject>Spectroscopy, Near-Infrared</dc:subject>
    <dc:subject>RANDOM FOREST</dc:subject>
    <dc:subject>Urban and suburban soil Cd contamination</dc:subject>
    <dc:subject>04 agricultural and veterinary sciences</dc:subject>
    <dc:subject>15. Life on land</dc:subject>
    <dc:subject>QUANTITATIVE-ANALYSIS</dc:subject>
    <dc:subject>6. Clean water</dc:subject>
    <dc:subject>RIVER DELTA</dc:subject>
    <dc:subject>13. Climate action</dc:subject>
    <dc:subject>Earth and Environmental Sciences</dc:subject>
    <dc:subject>Synthetic minority over-sampling technique</dc:subject>
    <dc:subject>0401 agriculture, forestry, and fisheries</dc:subject>
    <dc:subject>HEAVY-METAL CONCENTRATIONS</dc:subject>
    <dc:subject>Cadmium</dc:subject>
    <dc:creator rdf:resource="https://orcid.org/0000-0003-3914-5402"/>
    <dc:creator rdf:resource="https://orcid.org/0000-0003-1245-0482"/>
    <dc:creator rdf:resource="https://orcid.org/0000-0002-7442-3239"/>
    <dc:creator rdf:resource="https://orcid.org/0000-0001-5731-5036"/>
    <dc:creator>Hang Cheng, Ruili Shen, Zhou Shi, Zushuai Wei, Long Guo, Songchao Chen, Abdul Mounem Mouazen, Yaolin Liu, Jian Yang, Yiyun Chen, Gang Xu, Yongsheng Hong, Yongsheng Hong, </dc:creator>
    <dc:date>2021-12-01</dc:date>
    <dc:type>journalpaper</dc:type>
    <dct:abstract>Previous studies have mostly focused on using visible-to-near-infrared spectral technique to quantitatively estimate soil cadmium (Cd) content, whereas little attention has been paid to identifying soil Cd contamination from a perspective of spectral classification. Here, we developed a framework to compare the potential of two spectral transformations (i.e., raw reflectance and continuum removal [CR]), three optimization strategies (i.e., full-spectrum, Boruta feature selection, and synthetic minority over-sampling technique [SMOTE]), and three classification algorithms (i.e., partial least squares discriminant analysis, random forest [RF], and support vector machine) for diagnosing soil Cd contamination. A total of 536 soil samples were collected from urban and suburban areas located in Wuhan City, China. Specifically, Boruta and SMOTE strategies were aimed at selecting the most informative predictors and obtaining balanced training datasets, respectively. Results indicated that soils contaminated by Cd induced decrease in spectral reflectance magnitude. Classification models developed after Boruta and SMOTE strategies out-performed to those from full-spectrum. A diagnose model combining CR preprocessing, SMOTE strategy, and RF algorithm achieved the highest validation accuracy for soil Cd (Kappa = 0.74). This study provides a theoretical reference for rapid identification of and monitoring of soil Cd contamination in urban and suburban areas.</dct:abstract>
    <dc:title>Diagnosis of cadmium contamination in urban and suburban soils using visible-to-near-infrared spectroscopy</dc:title>
    <dc:identifier>10.1016/j.envpol.2021.118128</dc:identifier>
    <dct:references>https://doi.org/10.1016/j.envpol.2021.118128</dct:references>
    <dct:relation>818346</dct:relation>
  </rdf:Description>
</rdf:RDF>