Title | A database framework for rapid screening of structure-function relationships in PFAS chemistry |
---|---|
ID_Doc | 65 |
Authors | Su, A; Rajan, K |
Published | Scientific Data, 8, 1 |
Structure | The article describes a database framework, PFAS-Map, that enables rapid screening of structure-function relationships in PFAS (Per- and Polyfluoroalkyl Substances) chemistry. The framework maps high-dimensional information associated with the SMILES approach of encoding molecular structure with functionality data, including bioactivity and physicochemical properties. PFAS-Map is a 3D unsupervised visualization tool that can automatically classify new PFAS chemistries based on current PFAS classification criteria. The article introduces PFAS, a class of chemicals with outstanding qualities in chemical and thermal stability, water repellency, and oil repellency, which have been used in various industrial and commercial products. However, the presence of PFASs in freshwater systems, wildlife, and even human blood has raised serious public concerns about unknown dangers due to PFAS's high persistence, bioaccumulation potential, toxicity, and ease of being transmitted or transported through the environment. The article highlights the need for a database framework that can rapidly explore systematics in structure-function relationships associated with new and emerging PFAS chemistries. The article describes the methods used to develop the PFAS-Map framework, which includes SMILES standardization, descriptors calculation, PFAS structure classification, principal component analysis (PCA), and t-distributed stochastic neighbor embedding (t-SNE) visualization. The framework uses PaDEL-descriptors software to calculate molecular descriptors and fingerprints of the chemical structures, and RDKit to standardize SMILES from different sources. The article presents the results of using the PFAS-Map framework to classify PFAS substances, including the prediction and estimation of yet unmeasured fundamental physical properties of PFAS chemistries, uncovering hierarchical characteristics in existing classification schemes, and the fusion of data from diverse sources. The framework is also used to screen the relationship between PFAS structure and toxicity from two sets of experimental data. The article discusses the utility of the PFAS-Map framework, including its ability to detect and visualize sub-classifications of PFAS chemistry, screen the relationship between PFAS structure and toxicity, and predict the structure-function relationships of new PFAS chemistries. The framework is also shown to be versatile, allowing for the visualization of classification patterns and trends in structures-function relationships in PFAS chemistry. The article provides information on the code and data availability, including the availability of the PFAS-Map framework, datasets, and data pre-processing code. The framework is available on figshare, and the datasets and data pre-processing code are also available on figshare. The article concludes that the PFAS-Map framework is a useful tool for rapidly screening structure-function relationships in PFAS chemistry, and that it has the potential to be widely used in the field of PFAS research. The framework is also shown to be versatile, allowing for the visualization of classification patterns and trends in structures-function relationships in PFAS chemistry. The article introduces PFAS, a class of chemicals with outstanding qualities in chemical and thermal stability, water repellency, and oil repellency, which have been used in various industrial and commercial products. However, the presence of PFASs in freshwater systems, wildlife, and even human blood has raised serious public concerns about unknown dangers due to PFAS's high persistence, bioaccumulation potential, toxicity, and ease of being transmitted or transported through the environment. The article describes the methods used to develop the PFAS-Map framework, which includes SMILES standardization, descriptors calculation, PFAS structure classification, principal component analysis (PCA), and t-distributed stochastic neighbor embedding (t-SNE) visualization. The framework uses PaDEL-descriptors software to calculate molecular descriptors and fingerprints of the chemical structures, and RDKit to standardize SMILES from different sources. The article presents the results of using the PFAS-Map framework to classify PFAS substances, including the prediction and estimation of yet unmeasured fundamental physical properties of PFAS chemistries, uncovering hierarchical characteristics in existing classification schemes, and the fusion of data from diverse sources. The framework is also used to screen the relationship between PFAS structure and toxicity from two sets of experimental data. The article discusses the utility of the PFAS-Map framework, including its ability to detect and visualize sub-classifications of PFAS chemistry, screen the relationship between PFAS structure and toxicity, and predict the structure-function relationships of new PFAS chemistries. The framework is also shown to be versatile, allowing for the visualization of classification patterns and trends in structures-function relationships in PFAS chemistry. The article provides information on the code and data availability, including the availability of the PFAS-Map framework, datasets, and data pre-processing code. The framework is available on figshare, and the datasets and data pre-processing code are also available on figshare. The article concludes that the PFAS-Map framework is a useful tool for rapidly screening structure-function relationships in PFAS chemistry, and that it has the potential to be widely used in the field of PFAS research. The framework is also shown to be versatile, allowing for the visualization of classification patterns and trends in structures-function relationships in PFAS chemistry. |
Summary | The authors have developed a database framework, called PFAS-Map, to rapidly explore structure-function relationships in Per- and Polyfluoroalkyl Substances (PFAS) chemistry. PFASs are a class of compounds that have been widely used in various industrial and commercial products due to their unique properties, but have also raised concerns due to their persistence, bioaccumulation, and toxicity. The PFAS-Map uses a combination of machine learning and unsupervised learning techniques to classify PFAS compounds into different classes and subclasses based on their molecular structure and functionality. The framework uses Simplified Molecular Input Line Entry System (SMILES) format to represent the molecular structure of PFAS compounds and calculates molecular descriptors and fingerprints to capture their properties. The PFAS-Map also includes a 3D visualization tool that can automatically classify new PFAS chemistries based on current PFAS classification criteria. The framework has been trained using data from the US Environmental Protection Agency's (EPA) PFAS master list and has been validated using experimental data. The authors have demonstrated the utility of PFAS-Map in detecting and visualizing sub-classifications of PFAS chemistry, screening the relationship between PFAS structure and toxicity, and predicting the structure-function relationships of new PFAS compounds. The PFAS-Map is an open-source framework that can be used to explore the vast and growing dataset of PFAS compounds and to identify new hazards associated with these compounds. The framework has the potential to accelerate the development of new PFAS-free products and to inform regulatory decisions. The authors hope that PFAS-Map will become a widely-used tool in the scientific community to address the growing concerns about PFASs. |
Scientific Methods | The research methods used in this article are: 1. 2. 3. 4. 5. 6. 7. Overall, the authors used a range of research methods to develop a comprehensive database framework for rapid screening of structure-function relationships in PFAS chemistry. |
Article contribution | The article "A Database Framework for Rapid Screening of Structure-Function Relationships in PFAS Chemistry" by An Su and Krishna Rajan presents a novel database framework, called PFAS-Map, for rapid screening of structure-function relationships in PFAS chemistry. The framework is designed to facilitate the classification, visualization, and analysis of PFAS compounds, which are critical for understanding their potential environmental and health impacts. Contribution to Regenerative Economics: 1. 2. 3. 4. 5. In summary, the PFAS-Map framework has the potential to contribute to regenerative economics by providing a platform for analyzing the environmental impact of PFAS compounds, developing sustainable materials alternatives, promoting circular economy approaches, informing regulatory decisions, and facilitating open-source data sharing. |
No similar articles found.