2020 ScientificTextMiningandKnowledg

From GM-RKB
(Redirected from Jiang and Shang, 2020)
Jump to navigation Jump to search

Subject Headings: Scientific Text Mining.

Notes

08:00-08:10 Introduction (Shang) slides video
08:10-09:00 Mining Structures from Scientific Text (Shang): Phrase mining, etc. slides video
09:00-09:35 Extracting Information from Scientific Text (Jiang) slides video
09:35-10:05 Building Scientific Knowledge Graphs (Shang): Ontology, taxonomy, etc. slides video
10:05-10:30 Building Scientific Knowledge Graphs (Jiang): Biomedical KG, etc. slides video
10:30-10:40 Conclusions (Jiang) slides video

Cited By

Quotes

Abstract

Unstructured scientific text, in various forms of textual artifacts, including manuscripts, publications, patents, and proposals, is used to store the tremendous wealth of knowledge discovered after weeks, months, and years, developing hypotheses, working in the lab or clinic, and analyzing results. A grand challenge on data mining research is to develop effective methods for transforming the scientific text into well-structured forms (e.g., ontology, taxonomy, knowledge graphs), so that machine intelligent systems can build on them for hypothesis generation and validation. In this tutorial, we provide a comprehensive overview on recent research and development in this direction. First, we introduce a series of text mining methods that extract phrases, entities, scientific concepts, relations, claims, and experimental evidence. Then we discuss methods that construct and learn from scientific knowledge graphs for accurate search, document classification, and exploratory analysis. Specifically, we focus on scalable, effective, weakly supervised methods that work on text in sciences (e.g., chemistry, biology).

Introduction

Chapter 1 Part 1: Phrase Mining

Chapter 1 Part 2: Named Entity Recognition and Neural Language Models

Chapter 1 Part 3: Relation Extraction and Attribute Discovery

Chapter 1 Part 4: Scientific Statements

Chapter 2 Part 1: Taxonomy Construction

Chapter 2 Part 2: Knowledge Graph Learning

Conclusions

  • Methods for extracting entities (methods, research topics, technologies, tasks, materials, metrics, research contributions) and relationships from research publications
  • Methods for extracting metadata about authors, documents, datasets, grants, affiliations and others.
  • Methods for quality assessment of scientific knowledge graphs
  • Methods for the exploration, retrieval and visualization of scientific knowledge graphs
  • Scientific claims identification from textual contents • Novel user interfaces for interaction with paper, metadata, content, software and data
  • Data models (e.g., ontologies, vocabularies, schemas) for the description of scholarly data and the linking between scholarly data/software and academic papers that report or cite them
  • Automatic or semi-automatic approaches to making sense of research dynamics
  • Applications for the (semi-)automatic annotation of scholarly papers
  • Description and use of provenance information of scholarly data
  • Theoretical models describing the rhetorical and argumentative structure of scholarly papers and their application in practice
  • Novel user interfaces for interaction with paper, metadata, content, software and data
  • Visualization of related papers or data according to multiple dimensions (semantic similarity of abstracts, keywords, etc.)

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2020 ScientificTextMiningandKnowledgMeng Jiang
Jingbo Shang
Scientific Text Mining and Knowledge Graphs2020