Show simple item record

dc.contributor.authorMercado, Rocío
dc.contributor.authorKearnes, Steven M
dc.contributor.authorColey, Connor W
dc.date.accessioned2025-02-07T20:08:06Z
dc.date.available2025-02-07T20:08:06Z
dc.date.issued2023-07-24
dc.identifier.urihttps://hdl-handle-net.ezproxyberklee.flo.org/1721.1/158183
dc.description.abstractThe past decade has seen a number of impressive developments in predictive chemistry and reaction informatics driven by machine learning applications to computer-aided synthesis planning. While many of these developments have been made even with relatively small, bespoke data sets, in order to advance the role of AI in the field at scale, there must be significant improvements in the reporting of reaction data. Currently, the majority of publicly available data is reported in an unstructured format and heavily imbalanced toward high-yielding reactions, which influences the types of models that can be successfully trained. In this Perspective, we analyze several data curation and sharing initiatives that have seen success in chemistry and molecular biology. We discuss several factors that have contributed to their success and how we can take lessons from these case studies and apply them to reaction data. Finally, we spotlight the Open Reaction Database and summarize key actions the community can take toward making reaction data more findable, accessible, interoperable, and reusable (FAIR), including the use of mandates from funding agencies and publishers.en_US
dc.language.isoen
dc.publisherAmerican Chemical Societyen_US
dc.relation.isversionof10.1021/acs.jcim.3c00607en_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivativesen_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.sourceAmerican Chemical Societyen_US
dc.titleData Sharing in Chemistry: Lessons Learned and a Case for Mandating Structured Reaction Dataen_US
dc.typeArticleen_US
dc.identifier.citationRocío Mercado, Steven M. Kearnes, and Connor W. Cole. Journal of Chemical Information and Modeling 2023 63 (14), 4253-4265.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.relation.journalJournal of Chemical Information and Modelingen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2025-02-07T19:54:15Z
dspace.orderedauthorsMercado, R; Kearnes, SM; Coley, CWen_US
dspace.date.submission2025-02-07T19:54:17Z
mit.journal.volume63en_US
mit.journal.issue14en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record