Por favor, use este identificador para citar o enlazar este ítem:
https://repositorio.uti.edu.ec//handle/123456789/5356
Registro completo de metadatos
Campo DC | Valor | Lengua/Idioma |
---|---|---|
dc.contributor.author | Santos, Fabián | - |
dc.contributor.author | Acosta, Nicole | - |
dc.date.accessioned | 2023-06-12T14:49:24Z | - |
dc.date.available | 2023-06-12T14:49:24Z | - |
dc.date.issued | 2023 | - |
dc.identifier.uri | https://www.mdpi.com/2077-0472/13/5/1015 | - |
dc.identifier.uri | https://repositorio.uti.edu.ec//handle/123456789/5356 | - |
dc.description.abstract | Ensuring food security requires the publication of data in a timely manner, but often this information is not properly documented and evaluated. Therefore, the combination of databases from multiple sources is a common practice to curate the data and corroborate the results; however, this also results in incomplete cases. These tasks are often labor-intensive since they require a case-wise review to obtain the requested and completed information. To address these problems, an approach based on Selenium web-scraping software and the multiple imputation denoising autoencoders (MIDAS) algorithm is presented for a case study in Ecuador. The objective was to produce a multidimensional database, free of data gaps, with 72 species of food crops based on the data from 3 different open data web databases. This methodology resulted in an analysis-ready dataset with 43 parameters describing plant traits, nutritional composition, and planted areas of food crops, whose imputed data obtained an R-square of 0.84 for a control numerical parameter selected for validation. This enriched dataset was later clustered with K-means to report unprecedented insights into food crops cultivated in Ecuador. The methodology is useful for users who need to collect and curate data from different sources in a semi-automatic fashion. | es |
dc.language.iso | eng | es |
dc.publisher | Agriculture (Switzerland). Volume 13, Issue 5 | es |
dc.rights | openAccess | es |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | es |
dc.title | An Approach Based on Web Scraping and Denoising Encoders to Curate Food Security Datasets | es |
dc.type | article | es |
Aparece en las colecciones: | Artículos Científicos Indexados |
Ficheros en este ítem:
No hay ficheros asociados a este ítem.
Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons