Open Calais Web Service
(Redirected from Calais Semantic Metadata Service)
Jump to navigation
Jump to search
A Open Calais Web Service is a web service that creates semantic metadata from unstructured documents.
- AKA: Open Calais, Calais, Calais Semantic Metadata Service.
- Context:
- It is solved by Calais System.
- It is based on natural language processing and machine learning methods.
- It identifies entities, facts and events within the text.
- See: Calais System, PermID, Semantic Web, Resource Description Framework, Blog, Web Service, Thomson Reuters, Semantic, Unstructured Data, ClearForest, Tag (Metadata), Natural Language Processing.
References
2017a
- (Open Calais, 2017) ⇒ "Thomson Reuters Open Calais API User Guide", http://www.opencalais.com/wp-content/uploads/folder/Thomson-Reuters-Open-Calais-API-User-GuideR10_6.pdf
- Open Calais is a sophisticated Thomson Reuters web service that attaches intelligent metadata-tags to your unstructured content, enabling powerful text analytics. The Open Calais natural language processing engine automatically analyzes and tags your input files in such a way that your consuming application can both easily pinpoint relevant data, and effectively leverage the invaluable intelligence and insights contained within the text. Open Calais analyzes the semantic content of your input files using a combination of statistical, machine-learning, and custom pattern-based methods. Developed by the Text Metadata Services (TMS) group at Thomson Reuters, Open Calais outputs highly accurate and detailed metadata.
- Open Calais also maps your metadata-tags to Thomson Reuters unique IDs. This supports disambiguation (and linking) of data across all documents processed by Open Calais, and also offers you the opportunity to further enrich your data with related information from the Thomson Reuters datasets.
2017b
- (Wikipedia, 2017) ⇒ https://en.wikipedia.org/wiki/Calais_(Reuters_product) Retrieved:2017-5-28.
- Calais is a service by Thomson Reuters that automatically extracts semantic information from web pages in a format that can be used on the semantic web. [1] Calais was launched in January 2008, and is free to use. [2] [3] The Calais Web service reads unstructured text and returns Resource Description Framework formatted results identifying entities, facts and events within the text. [4] The service appears to be based on technology acquired when Reuters purchased ClearForest in 2007. The technology has also been used to automatically tag blog articles [5] and organize museum collections. [6]
Calais uses natural language processing technologies delivered via a web service interface.
- Calais is a service by Thomson Reuters that automatically extracts semantic information from web pages in a format that can be used on the semantic web. [1] Calais was launched in January 2008, and is free to use. [2] [3] The Calais Web service reads unstructured text and returns Resource Description Framework formatted results identifying entities, facts and events within the text. [4] The service appears to be based on technology acquired when Reuters purchased ClearForest in 2007. The technology has also been used to automatically tag blog articles [5] and organize museum collections. [6]
2017c
- (W3, 2017) ⇒ https://www.w3.org/2001/sw/wiki/Open_Calais Retrieved 2017-06-28
- Open Calais from Reuters is a web service that automatically attaches rich semantic metadata to the content you submit. Using natural language processing, machine learning and other methods, Calais categorizes and links your document with entities (people, places, organizations, etc.), facts (person ‘x’ works for company ‘y’), and events (person ‘z’ was appointed chairman of company ‘y’ on date ‘x’). The metadata results are stored centrally and returned as RDF constructs.
2008
- (Drupal, 2008) ⇒ Calais: Semantic Metadata Tagging for your Nodes, https://www.drupal.org/node/303763
- Overview: These modules are an integration of the Thomson Reuters' Calais web service into the Drupal platform. The Calais Web Service automatically creates rich semantic metadata for the content you submit – in well under a second. Using natural language processing, machine learning and other methods, Calais analyzes your document and finds the entities within it. But, Calais goes well beyond classic entity identification and returns the facts and events hidden within your text as well. The web service is free for commercial and non-commercial use, but reguires registration to obtain an API Key
- At its core, these modules allow you to automatically tag your data. However, the Calais Web Service allows it to be taken a step further by not only identifying the terms in your content, but also identifying the context and relevancy of those terms. Many services can tell you that IBM was mentioned in your content, but no other service identifies that IBM is an Organization, then disambiguates all the various references to IBM (International Business Machines, etc.), and finally tells you with a scoring mechanism that your content is more about IBM than any other term identified. A truly innovative service, powered by incredible technology and extremely dedicated people.