Natural Language Processing (NLP) System Benchmark Task

From GM-RKB
(Redirected from Benchmark NLP Task)
Jump to navigation Jump to search

A Natural Language Processing (NLP) System Benchmark Task is a AI benchmarking task that evaluates the AI system performance of NLP systems.



References

2023

  • (Wikipedia, 2023) ⇒ https://en.wikipedia.org/wiki/Natural-language_programming Retrieved:2023-11-12.
    • Natural-language programming (NLP) is an ontology-assisted way of programming in terms of natural-language sentences, e.g. English. A structured document with Content, sections and subsections for explanations of sentences forms a NLP document, which is actually a computer program. Natural language programming is not to be mixed up with natural language interfacing or voice control where a program is first written and then communicated with through natural language using an interface added on. In NLP the functionality of a program is organised only for the definition of the meaning of sentences. For instance, NLP can be used to represent all the knowledge of an autonomous robot. Having done so, its tasks can be scripted by its users so that the robot can execute them autonomously while keeping to prescribed rules of behaviour as determined by the robot's user. Such robots are called transparent robots [1] as their reasoning is transparent to users and this develops trust in robots. Natural language use and natural-language user interfaces include Inform 7, a natural programming language for making interactive fiction, Shakespeare, an esoteric natural programming language in the style of the plays of William Shakespeare, and Wolfram Alpha, a computational knowledge engine, using natural-language input. Some methods for program synthesis are based on natural-language programming. [2]
  1. Development of reliable and trustworthy robots. “transparent robots" }
  2. Desai, Aditya, et al. “Program synthesis using natural language." Proceedings of the 38th International Conference on Software Engineering. ACM, 2016.