Knowledge Base (KB)
A Knowledge Base (KB) is a data base of knowledge item records (such as KB concept records and KB relation records).
- AKA: Semantic Inventory/Resource, Body of Knowledge.
- Context:
- It can have one or more InstanceOf Semantic Relation References.
- It can have a Formal Specification.
- It can abide by some Knowledge Representation Model.
- It can support a KB-based Task (e.g. KB-based QA).
- It can approximate the capability of Human Memory.
- It can be managed by a Knowledge Base Management Task.
- It can be designed by a Knowledge Base Design Task.
- It can be populated by Knowledge Base Population Task.
- It can be managed by a Knowledge Base Management Task.
- It can range from being an Open KB (such as a Web KB) to being a Proprietary KB.
- It can range from being a Domain-Specific KB to being a Top-Level KB to being an Encyclopedic KB.
- It can range from being a Human-Processable KB to being a Machine-Processable KB.
- It can range from being a Manually Created KB to being an Automatically Created KB.
- It can range from being a Personal Knowledge Base to being an Organizational Knowledge Base.
- It can range from being an Offline Knowledge Base to being an Online Knowledge Base.
- It can range from being a Small KB to being a Medium KB to being a Large KB.
- It can range from being an Exact KB to being a Probabilistic KB.
- It can range from being a High-Quality KB to being a Low-Quality KB.
- It can range from being a Shallow KB (such as a knowledge graph), to being a Formal Knowledge base (such as a first-order KB).
- ...
- Example(s):
- Counter-Example(s):
- a Corpus.
- an Expert System.
- See: Information, Entity Database, Knowledge Resource, Belief System, Information Storage, Structured Data, Unstructured Information, Expert Systems, Knowledge-Based Systems, Inference Engine, Management Information Systems.
References
2016
- (Wikipedia, 2016) ⇒ http://en.wikipedia.org/wiki/knowledge_base Retrieved:2016-1-18.
- A knowledge base (KB) is a technology used to store complex structured and unstructured information used by a computer system. The initial use of the term was in connection with expert systems which were the first knowledge-based systems.
The original use of the term knowledge-base was to describe one of the two sub-systems of a knowledge-based system. A knowledge-based system consists of a knowledge-base that represents facts about the world and an inference engine that can reason about those facts and use rules and other forms of logic to deduce new facts or highlight inconsistencies.
The term "knowledge-base" was to distinguish from the more common widely used term database. At the time (the 1970s) virtually all large Management Information Systems stored their data in some type of hierarchical or relational database. At this point in the history of Information Technology the distinction between a database and a knowledge-base was clear and unambiguous. A database had the following properties:
- Flat data. Data was usually represented in a tabular format with strings or number in each field.
- Multiple users. A conventional database must support more than one user or system logged into the same data at the same time.
- Transactions. An essential requirement for a database was to maintain integrity and consistency among data that is accessed by concurrent users. These are the so-called ACID properties: Atomicity, Consistency, Isolation, and Durability.
- Large, long-lived data. A corporate database needed to support not just thousands but hundreds of thousands or beyond rows of data. Such a database usually needs to persist past the specific uses of any individual program, it needs to store data for years and decades rather than for the life of a program.
- The first knowledge-based systems had data needs that were the opposite of these database requirements. An expert system requires structured data. Not just tables with numbers and strings but pointers to other objects which in turn have additional pointers. The ideal representation for a knowledge-base is an object model (often called an ontology in AI literature) with classes, subclasses, and instances.
Early expert systems also had little need for multiple users or the complexity that comes with requiring transactional properties on data. The data for the early expert systems was used to arrive at a specific answer, a medical diagnosis, the design of a molecule, or a response to an emergency. Once the solution to the problem was known there was not a critical demand to store large amounts of data back to a permanent memory store. A more precise statement would be that given the technologies available researchers compromised and did without these capabilities because they realized they were beyond what could be expected and they could develop useful solutions to non-trivial problems without them. Even from the beginning the more astute researchers realized the potential benefits of being able to store, analyze, and reuse knowledge. For example, see the discussion of Corporate Memory in the earliest work of the Knowledge-Based Software Assistant program by Cordell Green et al. The volume requirements were also different for a knowledge-base compared to a conventional database. The knowledge-base needed to know facts about the world. For example, to represent the statement that "All humans are mortal". A database typically could not represent this general knowledge but instead would need to store information about thousands of tables that represented information about specific humans. Representing that all humans are mortal and being able to reason about any given human that they are mortal is the work of a knowledge-base. Representing that George, Mary, Sam, Jenna, Mike,... and hundreds of thousands of other customers are all humans with specific ages, sex, address, etc. is the work for a database. As expert systems moved from being prototypes to systems deployed in corporate environments the requirements for their data storage rapidly started to overlap with the standard database requirements for multiple, distributed users with support for transactions. Initially, the demand could be seen in two different but competitive markets. From the AI and Object-Oriented communities object-oriented databases such as Versant emerged. These were systems designed from the ground up to have support for object-oriented capabilities but also to support standard database services as well. On the other hand, the large database vendors such as Oracle added capabilities to their products that provided support for knowledge-base requirements such as class-subclass relations and rules. The next evolution for the term knowledge-base was the Internet. With the rise of the Internet documents, hypertext, and multimedia support were now critical for any corporate database. It was no longer enough to support large tables of data or relatively small objects that lived primarily in computer memory. Support for corporate web sites required persistence and transactions for documents. This created a whole new discipline known as Web Content Management. The other driver for document support was the rise of knowledge management vendors such as Lotus Notes. Knowledge Management actually predated the Internet but with the Internet there was great synergy between the two areas. Knowledge management products adopted the term "knowledge-base" to describe their repositories but the meaning had a subtle difference. In the case of previous knowledge-based systems the knowledge was primarily for the use of an automated system, to reason about and draw conclusions about the world. With knowledge management products the knowledge was primarily meant for humans, for example to serve as a repository of manuals, procedures, policies, best practices, reusable designs and code, etc. Of course in both cases the distinctions between the uses and kinds of systems were ill defined. As the technology scaled up it was rare to find a system that could really be cleanly classified as knowledge-based in the sense of an expert system that performed automated reasoning and knowledge-based in the sense of knowledge management that provided knowledge in the form of documents and media that could be leveraged by humans.
- A knowledge base (KB) is a technology used to store complex structured and unstructured information used by a computer system. The initial use of the term was in connection with expert systems which were the first knowledge-based systems.
2009
- http://en.wiktionary.org/wiki/knowledge_base
- A knowledge base (or knowledgebase; abbreviated KB, kb or Δ[citation needed]) is a special kind of database for knowledge management. It provides the means for the computerized collection, organization, and retrieval of knowledge.
- Machine-readable knowledge bases store knowledge in a computer-readable form, usually for the purpose of having automated deductive reasoning applied to them. They contain a set of data, often in the form of rules that describe the knowledge in a logically consistent manner. An ontology can define the structure of stored data - what types of entities are recorded and what their relationships are. Logical operators, such as And (conjunction), Or (disjunction), material implication and negation may be used to build it up from simpler pieces of information. Consequently, classical deduction can be used to reason about the knowledge in the knowledge base. Some machine-readable knowledge bases are used with artificial intelligence, for example as part of an expert system that focuses on a domain like prescription drugs or customs law. Such knowledge bases are also used by the semantic web.
- Human-readable knowledge bases are designed to allow people to retrieve and use the knowledge they contain. They are commonly used to complement a help desk or for sharing information among employees within an organization. They might store troubleshooting information, articles, white papers, user manuals, or answers to frequently asked questions. Typically, a search engine is used to locate information in the system, or users may browse through a classification scheme.
2009b
- amsglossary.allenpress.com/glossary/browse
- knowledge base: The facts, relationships, and procedures that constitute the knowledge about a given domain or task; the database of an expert (or knowledge based) system.
- www.ichnet.org/glossary.htm
- knowledge base: A store of knowledge about a domain represented in machine-processable form, which may be rules (in which case the knowledge base may be ...
- www.noisebetweenstations.com/personal/essays/metadata_glossary/metadata_glossary.html
- knowledge base: An ontology populated with data.
- cordis.europa.eu/ist/ka1/administrations/publications/glossary.htm
- knowledge base: A collection of stored facts, heuristics and models that can be used for problem solving.
- CYC Glossary http://www.cyc.com/cycdoc/ref/glossary.html
- knowledge base (KB): The CYC® KB is the repository of Cyc's knowledge. It consists of a large number of FORTs and an even larger number of assertions involving those constants.
2007a
- (Obitko, 2007) ⇒ Marek Obitko. (2007). “Translations between Ontologies in Multi-Agent Systems", Ph.D. dissertation, Faculty of Electrical Engineering, Czech Technical University in Prague. http://obitko.com/tutorials/ontologies-semantic-web/ontologies.html
- Ontology describes a domain, while a knowledge base (based on an ontology) describes particular state of affairs. Each knowledge based system or agent has its own knowledge base, and only what can be expressed using an ontology can be stored and used in the knowledge base. When an agent wants to communicate to another agent, he uses the constructs from some ontology. In order to understand in communication, ontologies must be shared between agents.
2007b
- (Obitko, 2007) ⇒ Marek Obitko. (2007). “Translations between Ontologies in Multi-Agent Systems", Ph.D. dissertation, Faculty of Electrical Engineering, Czech Technical University in Prague. http://obitko.com/tutorials/ontologies-semantic-web/specification-of-conceptualization.html
- The representation of a body of knowledge (knowledge base) is based on the specification of conceptualization. A conceptualization is an abstract, simplified view of the world that we wish to represent for some purpose. Every knowledge base, knowledge-based system or agent is committed to some conceptualization, explicitly or implicitly. For these systems, what "exists" is what can be represented. When the knowledge of a domain is represented in a declarative formalism, the set of objects that can be represented is called the universe of discourse. This set of objects and the describable relationships among them are reflected in the representational vocabulary with which a knowledge-based program represents knowledge. Thus, in the context of AI, we can describe the ontology of a program by defining a set of representational terms. In such an ontology, definitions associate the names of entities in the universe of discourse (e.g. classes, relations, functions, or other objects) with descriptions of what the names mean, and formal axioms that constrain the interpretation and well-formed use of these terms. Formally it can be said that an ontology is a statement of a logical theory.
2007c
- (Obitko, 2007) ⇒ Marek Obitko. (2007). “Translations between Ontologies in Multi-Agent Systems", Ph.D. dissertation, Faculty of Electrical Engineering, Czech Technical University in Prague. http://obitko.com/tutorials/ontologies-semantic-web/body-of-knowledge.html
- Sometimes, ontology is defined as a body of knowledge describing some domain, typically a common sense knowledge domain, using a representation vocabulary as described above. In this case, an ontology is not only the vocabulary, but the whole "upper" knowledge base (including the vocabulary that is used to describe this knowledge base).
The typical example is the project CYC that defines its knowledge base as an ontology for any other knowledge based system. CYC is the name of a very large, multi-contextual knowledge base and inference engine. CYC is an early attempt to do symbolic AI on a massive scaleby capturing common knowledge that is required to do tasks that are trivial for people, but very hard for computers. All of the knowledge in CYC is represented declaratively in the form of logical assertions. CYC contains over 400,000 significant assertions, which include simple statements of facts, rules about what conclusions to draw if certain statements of facts are satisfied, and rules about how to reason with certain types of facts and rules. New conclusions are derived by the inference engine using deductive reasoning. The CYC common sense knowledge can be used as a foundation of a knowledge base for any knowledge intensive system. In this sense, this body of knowledge can be viewed as an ontology of the knowledge base of the system.
- Sometimes, ontology is defined as a body of knowledge describing some domain, typically a common sense knowledge domain, using a representation vocabulary as described above. In this case, an ontology is not only the vocabulary, but the whole "upper" knowledge base (including the vocabulary that is used to describe this knowledge base).
[[Category:Quality Silver ]]