연구성과 상세
연구성과 상세
A machine learning framework for extracting information from biological pathway images in the literature
-
- 작성일
-
- 조회수
- 3
구분 | 논문 | 연구분야명 | 유전자회로 | ||||||
---|---|---|---|---|---|---|---|---|---|
연구책임자명(주관기관) | 김현욱 | 소속기관명(주관기관) | 한국과학기술원 | ||||||
학술지명 | Metabolic Engineering | 게재년월 | 2024-09 | ||||||
ISSN | DOI | ||||||||
게재권 | 82 | 게재호 | 0 | ||||||
링크 | https://www.sciencedirect.com/science/article/pii/S1096717624001125 | ||||||||
초록 | There have been significant advances in literature mining, allowing for the extraction of target information from the literature. However, biological literature often includes biological pathway images that are difficult to extract in an easily editable format. To address this challenge, this study aims to develop a machine learning framework called the “Extraction of Biological Pathway Information” (EBPI). The framework automates the search for relevant publications, extracts biological pathway information from images within the literature, including genes, enzymes, and metabolites, and generates the output in a tabular format. For this, this framework determines the direction of biochemical reactions, and detects and classifies texts within biological pathway images. Performance of EBPI was evaluated by comparing the extracted pathway information with manually curated pathway maps. EBPI will be useful for extracting biological pathway information from the literature in a high-throughput manner, and can be used for pathway studies, including metabolic engineering. |