US20140303963A1 - Natural language parsing method to provide conceptual flow - Google Patents

Natural language parsing method to provide conceptual flow Download PDF

Info

Publication number
US20140303963A1
US20140303963A1 US14/273,143 US201414273143A US2014303963A1 US 20140303963 A1 US20140303963 A1 US 20140303963A1 US 201414273143 A US201414273143 A US 201414273143A US 2014303963 A1 US2014303963 A1 US 2014303963A1
Authority
US
United States
Prior art keywords
neuron
clump
concept
sentence
verb
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/273,143
Inventor
Lukas K. WOMACK
Thomas A. Visel
Matthew Fettig
Hannah LINDSLEY
Micah LEDBETTER
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Neuric LLC
Original Assignee
Neuric LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/154,313 external-priority patent/US7089218B1/en
Priority claimed from US11/670,959 external-priority patent/US20070156625A1/en
Priority claimed from US11/697,721 external-priority patent/US8001067B2/en
Priority claimed from US11/758,667 external-priority patent/US7925492B2/en
Application filed by Neuric LLC filed Critical Neuric LLC
Priority to US14/273,143 priority Critical patent/US20140303963A1/en
Publication of US20140303963A1 publication Critical patent/US20140303963A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06F17/2705
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F17/28
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language

Definitions

  • the present invention pertains in general to artificial brains and, more particularly, to the aspect of storing information as a plurality of individual concepts that are capable of being connected to each other through predetermined relationships.
  • a first principals system which defines the artificial brain in the form of a plurality of rules and various hierarchical structures.
  • some linguistics systems utilize a database comprised of a plurality of linguistic rules that are used to parse a sentence into its individual parts, such as subject, verb and object and then organize sentences as groups of components into paragraphs and the paragraphs into an overall outline. Some of these systems have rules to determine the concepts behind particular sentences and paragraphs, and even generate an overall outline.
  • a second type of system utilizes a neural network which is comprised of a plurality of weighted nodes that are interconnected with input and output nodes.
  • activation nodes are trained to provide an overall representation of a system. These are non-linear networks and each node has a function that is determined by training and the such. No one given node has a predefined relationship. Any type of nodal system utilized to represent a series of complex relationships typically requires a plurality of individual node, each defined in terms of activation levels, such that the “firing” of a particular node based upon an input or an event will cause the firing of sequential nodes to represent an overall concept. The use of such nodes does not lend itself to defining concepts in the discrete manner or of interconnecting such concepts in an adequate manner.
  • the present invention disclosed and Claimed herein comprises a method for parsing the flow of natural human language to convert a flow of machine recognizable language into a conceptual flow.
  • the lexical structure is recognized and then, a basic semantic grouping is determined for the language flow in the lexical structure.
  • the basic semantic grouping is then determined that denotes the main action, occurrence or state of being for the language flow.
  • the responsibility of the main action, occurrence or state of being for the language flow is then determined within the lexical structure followed by semantically parsing the lexical structure. Thereafter, any ambiguities in the responsibilities are resolved in a recursive manner by applying a predetermined set of rules thereto.
  • FIG. 1 illustrates an overall block diagram of a general artificial brain and the associated neuron database
  • FIG. 2 illustrates the concept organization
  • FIG. 3 illustrates a diagrammatic view of one concept flow
  • FIG. 4 illustrates a block diagram of a plurality of neurons showing the interconnection thereof through relationships
  • FIG. 5 illustrates a diagrammatic view of various relationship flows defined as clumps
  • FIG. 6 illustrates an alternate diagrammatic view of the organization of clumps
  • FIG. 7 illustrates a block diagram of the general overall artificial brain and the operation of receiving a textual input and providing an output
  • FIG. 8 illustrates a diagrammatic view of the data fields for a relational neuron
  • FIG. 9 illustrates a nodal array for one conceptual flow
  • FIG. 10 illustrates a detail of the nodal array of FIG. 10 ;
  • FIG. 11 illustrates a diagrammatic view of interconnections between neurons
  • FIG. 12 illustrates a diagrammatic view of the parser
  • FIG. 12 a illustrates a diagram of the system linkage to clumps
  • FIGS. 13-18 are tables illustrating clump definitions.
  • FIG. 1 there is illustrated a general block diagram of the processing system that represents the artificial brain.
  • the brain processing system is referred to by a reference numeral 102 . It is operable to receive an input on an input line 104 and provide an output on an output node 106 .
  • the system will have a discourse with a user. The user will input text in the form of a query (or input voice that is translated to text or some similar machine recognizable form).
  • the input may be a question that may ask the system to “show me the tree,” which is then interpreted by the system to possibly display a picture of a tree. It may be that the system detects some type of ambiguity in that it does not understand whether the query is trying to ask it about a plant or a graph, as the term “tree” could be the plant defined as a tree or it could be a graph that is in the form of a tree structure. In this situation, the output would be a question to the user asking whether the user means the plant or the graph. Another input would then be received indicating that it was the plant. The system would then process this further.
  • the brain processing system 102 interfaces with a nodal array 104 comprised of a plurality of individual neuron nodes, hereinafter referred to as “neurons.
  • the nodal array 104 is operable to store in each of the plurality of neurons a unique concept.
  • This concept can relate to a hard object or an abstract idea.
  • the concept could be the concept of a tree in a plant based context. It could also be the concept of a “truck.” Further, it could be related to the concept of a “fire truck.” Any complex number of words could define a particular concept and this particular concept does not necessarily require that it be a single word but, rather, it can be a combination of words and could even include the concept embodied in a phrase.
  • each concept has some type of relationship with other concepts, when taken in the context of a sentence.
  • a plurality of individual concepts are organized in a particular sequence and organization. Therefore, it is the structure and sequence of an individual concept in a particular manner which defines an idea or a “conceptual flow.” The details of this conceptual flow will be described in more detail herein below.
  • FIG. 2 there is illustrated a diagrammatic view of the concept universe.
  • nodes nodes, noting that “node” and “neuron” will be utilized interchangeably throughout
  • Each of these concepts has a group of relationships associated therewith. For example, a particular sentence could be utilized in a discourse wherein it was stated that “A leopard named Egor is in the cat family and has white fur and wears a red collar.” There are a plurality of relationships that exist within this sentence.
  • a sentence When defining a particular grouping, i.e., a sentence, all that is necessary is to define the concepts that are defined in this grouping, a sentence making a logical grouping in natural language, and then define the relationships between the various concepts. Once the concepts in a grouping, i.e., a sentence, and the relationships between those concepts, is defined, the overall idea or concept of the sentence or logical grouping is defined. This can be further extended to a paragraph, a chapter in an overall book, etc. Again, this will all be described in much more detail herein below. Referring back to FIG. 2 , there are illustrated a plurality of neurons, each defining concepts. There is defined a first grouping 202 and a second grouping 203 . These each being independent groupings. A particular grouping of concepts and their relationships would be associated with, for example, a particular sentence.
  • FIG. 3 there is illustrated a diagrammatic view of a conceptual flow in a particular overall conceptual flow.
  • the following sentence is that which is represented by the conceptual flow of FIG. 3 :
  • the sequence flow would then go between various neurons.
  • the flow of a sentence would be the first relationship between “dog” and “domestic animal” along a first path represented by ⁇ circle around (1) ⁇ .
  • dog will have a stored relationship to “companions” and companions will have a stored relationship back to “dog.” It is understood that stored in the “dog” neuron is the relationship to other neurons and not the relationship “from” other neurons, i.e., all that is important for the dog neuron is to know that it has an associative relationship to the companion neuron and not to know that the companionship neuron has a back relationship to the dog neuron. In a similar manner, the next relationship in the flow will be the relationship of the “dog” neuron to the “canine” neuron, which is one of parent/child.
  • the “dog” neuron indicates that there is a parent relationship to the “canine” neuron and the “canine” neuron will indicate a child relationship back to the “dog” relationship. Thus, the “child” relationship is not stored in the “dog” relationship. There will then be a relationship defined between “canine” and “hairy,” this being an associative relationship. The last relationship will be the instance_of relationship to the “Fido” neuron. There will be a relationship in one direction to “Fido” that is an instance relationship and the back relationship from “Fido” to “dog” will be a “instance_of” relationship. These relationships will be described in more detail herein below.
  • the nodal array and the overall information stored for the artificial brain is comprised of a plurality of concept neurons that are interconnected together with various relationships. That is to say, one particular neuron may have thousands of potential relationships to other neurons.
  • the “dog” neuron may be related to the “canine” neuron, the “dog body” neuron, the “fur” neuron, etc.
  • these relationships mean nothing. It is when the concept flow is disposed in some organized and structural conceptual manner that these concepts give meaning to a particular overall idea that is conveyed by the sequence, structure and organization of the particular grouping and flow of concepts.
  • FIG. 4 there is illustrated a single concept node or neuron 404 that is associated with a plurality of other concept nodes 406 labeled concept node b, concept node c . . . concept node N.
  • Each of the concept neurons or nodes has associated with it a set of properties in a block 408 defining certain aspects of a particular neuron.
  • the property block might include such things as it being thin, having a size less than a particular diameter, being made of wood, fiber, etc. These are certain things that are fixed with respect to the properties thereof.
  • concept neurons can be created with certain properties such as the type of word they are, i.e., noun, verb, etc., and they can have defined properties associated therewith. Alternatively, these concept neurons can be “learned” neurons. There can be text neurons, there can be identity neurons, etc. However, each neuron or node constitutes a particular concept of either a hard object or an abstract idea. It is defined in the system by a neuron ID “NID” which is all that is necessary to refer one neuron to another, i.e., it is a flow of NIDs. ⁇ ???? ⁇
  • the particular neuron 404 is defined as having a plurality of relns to the other neurons or nodes 406 .
  • These relns are useful when querying the knowledge set that includes the nodal array of neurons about particular relationships. For example, it may be desirable to determine information about a dog. The nodal array would go in and determine the closest relationships and then the most distant relationships. For example, it might first determine that dogs are canines and that dogs possess a dog body. It then may go one more step and determine that, since the dog exists in a dog body, it possesses paws and then a further relationship would be that it possesses claws because it possesses paws.
  • the direct relationship would be the immediate relationship to a particular neuron and then a more distant relationship, i.e., removed by one relationship, would be a further relationship.
  • the query could be defined in terms of specific relationships, one example being a “made_of” relationship that defines some neuron that constitutes the makeup of a particular object.
  • a dog body for example, would be made of paws, a tail and hair, for example.
  • a word can be parsed from a particular sentence that exists within a particular paragraph or discourse and “conceptualized” into the particular neuron, wherein the properties of that neuron may be defined and the relationships between that particular concept and other concept neurons defined.
  • the relationships are defined between various other neurons that exist within the particular discourse. It may be that the discourse results in a new relationship that will be presented. For example, if the discourse indicated that “the dog has a red collar.” The indication would be that there is a POSSN (possession) reln between the “dog” neuron and a “red collar” neuron, which may have previously existed.
  • the first is a simple neuron. This is one wherein a single concept is stored in a single neuron or node, where the neuron is simply a place-holder for that concept.
  • relational connections, “relns,” are provided to other concepts (or words) that give the neuron meaning within the overall context of the brain to provide a knowledge set.
  • These simple neurons store relationships between concepts in a factual manner. For example, the word “dog” forms a “dog” neuron, and that neuron is linked through a reln to other neurons that help to establish the concept of the dog in the system. This may be simply a single neuron or multiple neurons requiring multiple relns.
  • Neuron IDs (NID) for both simple and complex neurons share the same number space.
  • Complex neurons are another type of neuron, which are required in that there are occasionally repeated references to nouns that have a qualifying adjective, such as “black dog.”
  • a complex neuron is created to handle such cases.
  • the complex neuron and its associated concept carries all the implications of “dog” and the qualification of “red,” for example, but without the overhead of its own copy of all the linkages. This is due to the fact that the complex “red dog” concept can be viewed in a single concept, albeit requiring two words to convey that concept in text.
  • the simple neuron refers to a specific instance of “dog” and is connected to “dog” by an INST_OF “dog” and from “dog” to “that dog” by an INST reln.
  • clump neuron An additional type of neuron is referred to as the “clump” neuron. This is the concept of gathering “clumps” of information about an action that has taken place and garners with it all the history of that particular action or event. As such, every clump neuron contains a connection to a verb (a distinguishing characteristic). Such clumps are the repository for actions taken by nouns, and each such clump implies what can be viewed as the content of an independent clause, with a variant handling dependent clauses.
  • the brain parses a sentence and outputs a single Conceptual Clump which stores the parsed “thought.”
  • Conceptual Clumps store the thought and not the words, i.e., it is the semantic organization of concepts for that particular sentence, it being understood that a sentence is normally thought of as defining a particular single thought (keeping in mind that some sentences convey multiple thoughts and they have to be dealt with separately).
  • the capability of capturing a diverse collection of input streams or concepts i.e., “concept flows” is provided for.
  • the possibility of analyzing these different streams as being conceptually equal is also accounted for, and providing a diverse range of sentence regeneration for creating an output is also provided for. This will be described in more detail herein below in defining conceptual flows.
  • one Conceptual Clump is a basic sentence clump that, at its most basic level, takes the words and phrases of a sentence and converts them to a series of semantic roles, one of which is the PAS verb wherein the main verb that assigns most of the other roles of a particular clump are defined.
  • each Conceptual Clump is defined as providing a role. It is both a neuron that has a defined set of concepts associated therewith such that it points to these particular concept neurons with a particular reln defining the relationship of that particular concept with the “clump” (that being a sentence) and also defines the role of the clump. There are multiple roles associated with clumps, however, which will be described and set forth herein below.
  • SC Sentence Clump
  • One SC is defined by the Clump ID (CID) of “9017.” There is also provided an SC “9181” and an SC “8911” with multiple SCs possibly contained there between.
  • the entire group of SCs are organized in what is referred to as a Controller Clump (CC).
  • CID Clump ID
  • the Controller Clump is the way to store information at a much higher level.
  • the Controller Clump provides a linear progression of the thought of a sentence. This is typically a logical progression (i.e., Want, then realization, expectation then reality).
  • the Controller Clump is basically the essence of connected thoughts in a particular grouping of ideas defined by sentences. This, again, will be defined in more detail herein below.
  • FIG. 6 there is illustrated an alternate diagrammatic view of how a sequence of ideas can be defined.
  • the sentence describes a particular thought and various sentences in a paragraph define a logical progression of those thoughts which are grouped together in some logical manner.
  • Two sequential paragraphs define two thought groups.
  • a first paragraph Para 1 which has associated therewith a plurality of sentence clumps, SC 1 , SC 2 , . . . SCN. This is referred to as Controller Clump, CC 1 .
  • the second paragraph, Para 2 has sentence clumps, SC 1 , SC 2 , . . .
  • Controller Clump SCM, tied together as a Controller Clump, CC 2 . What this all provides, at an even higher level, is an Outline Clump (OC). This is basically a larger-view picture of where the textual material is heading, i.e., it provides a scope or outline for a topic, e.g., “Where is this book going?” An Outline Clump could summarize a paragraph or it could provide a summarization of chapters in the book (if appropriate).
  • FIG. 7 there is illustrated the general structure for parsing the English language based upon an input, be it some type of text or verbal input and conceptualizing that. This will be described further herein below. Additionally, the general structure for the system for storing the concept neurons, etc., will also be described herein below. For the purpose of this discussion, it is necessary to determine how concepts can flow in a particular sentence and the structure will provide some benefit.
  • the event or text input is provided at a block 702 and this is input to a tokenizer 704 .
  • the tokenizer is basically operable to convert the text into a list of sentence tokens and/or NIDs.
  • the tokenizer is an input to a parse block 706 to parse a sentence or fragments thereof into a set of alternative lists of tokens, with weighting alternatives.
  • This is then output to a conceptualize block 708 that is operable to determine the concept behind particular words as they exist within a sentence.
  • This passes to a resolve block 710 to resolve the NIDs from a generic class to find the specific NID referred to by this particular word as it exists within the sentence.
  • a clump is created at a block 712 which assigns roles to groups of NIDs to form both the group of NIDs and the role associated therewith.
  • This block then flows to a block 714 to create the relns from the clump. This is an attempt to convert the clumps to simpler form as relns with associated NIDs. This will then provide a completion of the operation, at a block 716 .
  • the function of the conceptualizer 708 is to process the parser output, creating new neurons when necessary and storing conceptual associations derived from the incoming text.
  • This is a two stage process.
  • the first stage is organizational, in which the parser stack output is deposited into a structure to facilitate the creation of relational linkages. From this structure, the information is processed to create relational linkages between concepts.
  • the conceptualizer 708 receives the stack output from the sentence parser, and places each word into a structure derived from the sentence structure of the English language.
  • the structure is an array of clauses, and the first clause is the primary clause of the sentence. Each clause is structured to contain all the possible elements of a clause.
  • Noun phrases consist of a group of up to ten nouns that collectively are the subject, direct object, indirect object, object of the preposition, or part of a complement.
  • Each of the nouns can have up to ten adjectives attached, and may have a preposition chain attached that is of unlimited length.
  • Each noun may also have a relative adjective clause attached.
  • Verb phrases consist of a group of up to ten verbs that contains the main verbs of the clause.
  • the predicate of the clause is the verb phrase.
  • Each verb may have ten adverb modifiers.
  • the verb phrase may have one preposition chain, which is attached to the first verb of the verb phrase.
  • the second stage of the conceptualization process is referred to as the Emit Process” and entails taking the newly organized information and converting it into neurons and relationals.
  • the process is done by identifying different classes of sentences that can all be processed in the same way.
  • the different classes of sentences are identified based on the structure and the content of the sentence. Different elements within the clause structure are tested to see if they meet certain conditions.
  • the nouns and verbs used will often hold clues that will indicate the conceptual relationships being expressed by the sentence.
  • the nouns and verbs of each sentence are preprocessed to derive as much useful information as may be necessary. This information is then made available for easy access when needed.
  • the information stored for each word is derived from the noun or verb info block.
  • the types of easy access information available are:
  • the parser is a natural language parser which can utilize many different types of approaches.
  • the type that is used in the present embodiment is a predicate-argument-parser using a Backus Nauer Form (BNF) layout description for the language.
  • BNF Backus Nauer Form
  • PAS predicate argument structure
  • the tokenizer block 704 is the operation wherein incoming sentences are broken down into “tokens,” items corresponding to individual words and punctuation, but in which the words have already been located by some form of internal serial number, if they were already known. Therefore, this simplified list (non-text) is then forwarded to the parser for subsequent processing.
  • the parse block 706 is the device that breaks the sentences into its simpler parts, wherein the parser is defined by a top-down description of the language.
  • a Backus Nauer Form (BNF) is utilized which is a natural language C++ parser for the description of the English grammar which is utilized to translate English grammar to computer language.
  • BNF Backus Nauer Form
  • the tokenizer initially breaks the sentence into items corresponding to individual words and punctuation and then the parser breaks the sentences down into their simpler parts and the constituent parts are saved in some manner. This is, in general, a “semantic” parser, and is driven largely by the grammar and syntax of valid sentences.
  • a “clump” as defined herein is a form of a neuron that remembers something that relates to action, aspect, or is only true for a specific time, such as “tomorrow.”
  • Those phrases or clumps that simply define something, such as “all birds fly,” are more simply stored as relationships between simple neurons. A large class of information can be stored in this manner, although storing in neurons (not clump neurons) becomes impractical under many cases.
  • all sentences are formed into clumps; subsequent processes determine if they should be converted to neurons relns and deleted, or be permitted to remain.
  • the main operations of conceptualization are:
  • the parser builds many useful lists of data that include a history of recent content in a context pool (a context pool is comprised of relns recently used, i.e., the history thereof, and which may or may not have a temporal aspect thereto, i.e., they will decay after a period of time if they have not reoccurred in the discourse so as to be reset.
  • a context pool is comprised of relns recently used, i.e., the history thereof, and which may or may not have a temporal aspect thereto, i.e., they will decay after a period of time if they have not reoccurred in the discourse so as to be reset.
  • This provides some knowledge of the recent history of the discourse.
  • some of the utilities are called on by the conceptualize block 708 in order to resolve classes (general families) of neurons into specific neurons relating to the current topic.
  • Some clumps can be reused (shared) by several other clumps as can sentence fragments that occur multiple times,
  • a clump that was converted to relns may be deleted or may be retained for further use by the control or outline clump. After the conversion process, the parse-to-clump process is concluded.
  • Relns connections between normal neurons in the brain are made through “relationals,” “relns” for short.
  • Relns for other types of neurons e.g., clump or identity neurons
  • relns have special relns and are given other names such as Roles for clump neurons.
  • Each type of neuron has its own numbering sequence (or number space) for enumerating their relational connections.
  • each reln has an 8-bit field in the Most Significant Bits (MSBs) that specifies the type of reln, this being the Cmd field.
  • MSBs Most Significant Bits
  • the 8-bit Cmd field may be divided into separate Opc and Percentage sub-fields. This would be a 3-bit Opc field and a 5-bit Cmd field.
  • the Adhoc flag defines the bit as a temporary or a real reln, as will be described herein below.
  • the 23-bit neuron field comprises the serial number therefore of the neuron or of the clump.
  • the Cmd field for a non-weighted neuron relns utilizes the entire 8-bit filed such that a maximum of 255 relns can be defined. This, of course, is up to the system designer as to how relns are defined. Appendix A1 provides for some relns and their respective definitions, it being noted that this is not an exhaustive list and that additional relns are utilized or can be defined.
  • the 5-bit scaled unsigned percentage field that constitutes the five Least Significant Bits (LSB) of the Cmd field are used. This leaves a very limited number of command opcodes, limited to 5-6 such relns in total.
  • each such reln diminishes the available number of unweighted relns by 32, down from the total of 255 for the non-weighted relns.
  • this can be remedied. It should be understood that the overall length of the reln could be increased to expand the addressing space.
  • each count amounts to roughly 3.3%, going from a value of 0% for the bit value of “0” and a value of 100% for a bit value of “31.” Unless otherwise specified by the source data creating a linkage, the weight is set to approximately 50% by default.
  • the various Op codes associated with the weighted neurons for one example of the system is set forth in Appendix A1.
  • FIG. 9 there is illustrated a diagrammatic view of a particular set of relationships between neurons. This represents the state of this particular set of neurons and their relationships at a given time. The relationships have already been created and the neurons are “fixed” for this particular example. As such, this is representative of a static state of the system at a particular moment in time. It should be understood that during receipt of some type of input in the form of a discourse, for example, additional concepts can be introduced or, additional relationships between concepts can be introduced. As noted herein above, for example, there could be a discourse that indicated that a leopard, associated with a particular conceptual neuron, had a leash.
  • the cat neuron has multiple direct relationships, i.e., once removed therefrom.
  • the cat neuron has a parent/child relationship with a Feline neuron 904 such that there is a PARENT reln stored at the Cat neuron 902 pointing to the Feline neuron 904 and a CHILD reln stored at the Feline neuron 904 pointing to the Cat neuron 902 .
  • the Cat neuron 902 has a POSSN reln pointing to a Cat Body neuron 906 with a back-reln POSSN_OF pointing back to the Cat neuron 902 from the Cat Body neuron 906 .
  • Garfield neuron 908 is merely the concept of a particular instance of which, when looking at the properties, will allow a system to determine the textual word associated therewith.
  • the neuron itself is merely the concept of a particular instance of a cat that has associated therewith the various relns and the such.
  • the Garfield neuron 908 has a relationship with an object neuron 910 associated with a red collar, this being Red Collar neuron 910 .
  • This is a POSSN reln pointing from the neuron 908 to the neuron 910 with a back-reln POSSN_OF pointing back thereto from the Red Collar neuron 910 .
  • the Garfield neuron 908 has a relationship with a neuron 912 being a possession relationship such that a POSSN reln exists at neuron 908 pointing to neuron 912 .
  • Neuron 912 is that associated with an object, a white striped tail. Thus, this is a White Striped Tail neuron 912 .
  • the Garfield neuron 908 merely provides an instance of the Cat neuron 902 such that a potential query of “What bears the name Garfield?” will result in a search that will find the Garfield neuron 908 and allow the back-reln to direct the search to the Cat neuron 902 .
  • the Cat Body neuron 906 has a plurality of constituent parts, these being neurons 914 , 916 and 918 associated with paws, pointy ears and a tail, respectively. Each of these has a MADE_OF reln associated therewith, and each of these has a back-reln PART_OF back to the Cat Body neuron 906 .
  • the paws neuron 914 has a MADE_OF reln to a neuron 920 associated with claws, with a back-reln PART_OF to neuron 914 .
  • the Feline neuron 904 As cat moves to higher level concepts, the Feline neuron 904 , etc., a different higher level hierarchy is reached. However, the relns are not associated with hierarchy. This is merely embedded within the reins themselves.
  • the Feline neuron 904 is related to a higher level neuron, the Animal neuron 924 , with a PARENT rein stored at the Feline neuron 904 pointing to the Animal neuron 924 and a CHILD rein stored at the Animal neuron 924 pointing to the Feline neuron 904 . Understand that the rein stored at any given neuron defines the type of rein it is and then the NID associated with that rein defines the particular neuron to which it is related in the defined manner.
  • the Animal neuron 924 can have a PARENT/CHILD relationship with respect to a Skunk neuron 926 (it being noted that there are probably different levels of PARENT/CHILD relationships therebetween but this has been reduced for simplicity purposes).
  • the Skunk neuron 924 has an INST rein to a neuron 928 associated with the name Pepi. This is illustrated to show that neuron 924 also has associated therewith a POSSN relationship with respect to the neuron 912 for the white striped tail.
  • both the Garfield neuron 908 and the Pepi neuron 928 have a relationship to the same white striped tail.
  • an ASSOC rein between the white Striped Tail Neuron 912 and the Tail neuron 918 This is a bidirectional neuron showing that there is an association between the two.
  • these relationships allow one to gain a significant amount of information about a particular concept. For example, suppose that a query were made to this particular nodal array inquiring the following “Tell me about a cat?” and including the search criteria that it include PARENT/CHILD relationships, all instances of the cat and what the cat was made of, at least to one degree. This would return information such as the following “A cat is a member of the feline category and it possesses a cat body. The cat body is made of paws, pointy ears and a tail.
  • Garfield One particular cat is that named Garfield.” As one can see, this required nothing more than to determine the sequence of concepts centering at the Cat neuron 904 and following the relationships outward therefrom to the other concepts by only a single rein step, i.e., one degree of search.
  • FIG. 10 there is illustrated a more simplified diagram illustrating the diagram up to at least the Feline neuron 924 .
  • Each of these neurons that are illustrated in FIG. 10 leading out the Skunk neuron 926 and the Pepi neuron 928 in addition to the Tail neuron 918 , provide neuron numbers. As illustrated, each neuron number is underscored to distinguish it from a reference numeral.
  • the Cat Body neuron number is “3011,” the Cat neuron 902 has an NID “2031” and the Feline neuron 924 has a NID “1003.” Therefore, the inquiry would be “Tell me everything about a cat.” This would be translated to the NID “2031.” This would then take off with a response “A ‘2031’ is in the category ‘1003’ and possesses a ‘ 3011 .’
  • the ‘3011’ is made up of a ‘3005’ and a ‘ 3101 .’
  • There is a ‘2031’ that exists that is named ‘7041.’” All that is required is to utilize the properties portion of the particular neuron to determine the text associated with that particular neuron to complete the query.
  • FIG. 11 there is illustrated a diagrammatic view of how the various neurons illustrated in FIG. 10 are related to each other.
  • Each of the concept neurons has associated therewith, as described herein above, an NID.
  • Also associated therewith, in addition to the properties aspect (not shown) are the relns.
  • Each of the relns constitutes the type of reln it is in addition to the NID of the concept neuron to which it points.
  • the Cat neuron 902 there is assigned an NID of “2031” thereto. This has a POSSN reln associated therewith which points to the neuron “3011” which is the NID of the Cat Body neuron 906 .
  • the Cat Neuron NID “2031” has associated therewith an INST reln pointing to NID “7041.” This is the NID of the Garfield neuron 908 . Since there is a back-reln associated with each of these, the Cat Body neuron 906 has a POSSN_OF reln pointing to neuron “2031,” the Cat neuron 902 . Similarly, the Garfield neuron NID “7041,” has an INST_OF reln pointing to NID “2031,” the NID of the Cat neuron 902 . Similarly, all of the other neurons have the various relns pointing to different neurons. By looking at the relationships, various queries can be answered.
  • the question might be “What type of animal might have a red collar?” This would merely require the system to go to the neuron 910 and look at the back-reln to go all the way back to a neuron having a PARENT/CHILD relationship that led back to the Animal neuron 924 . This will result in the neuron 902 . This can quickly be done through the use of the relational interconnections and then merely look up the text associated with the neuron 902 and provide the answer, “This is a cat.”
  • Another query could be “What does Garfield possess?” This would merely require going to the neuron 908 and going back to any neuron that would have the closest MADE_OF reln associated therewith. This would come back with the answer that it possessed “a cat body with paws, claws, pointy ears and a tail.”
  • reins there are different types of reins. Some are parent-like, some are child-like and some are sibling-like.
  • the parent-like reins are reins such as PARENT, PART_OF, INST_OF and POSSN_OF. This means they point back to a higher hierarchical order.
  • the child-like reins would be those such as CHILD, INST, MADE_OF and ATTRIB.
  • the sibling-like reins would be those such as ASSOC and NAT_ATC.
  • clumps The concept of “Clumps” is defined as a neuron that contains a reference to a verb (a requirement) and that takes a plurality of concepts, be it words or phrases of a sentence or otherwise, and converts them to a series of semantic roles. As such, it is capable of capturing the knowledge embodied within an independent clause. With respect to clumps of sentences, this is again geared by the role of those sentences from a semantic standpoint. It should be understood that the term “relational interconnections” when used with concept neurons defines a relationship between one neuron and another neuron. In a similar manner, a “role” is still a relationship between a clump neuron and all of the concept neurons that make up the clump. The role defines the relationship of each of these neurons in the clump or each of the sentences within a clump. To understand the concept of clumps, it is first necessary to determine how clumps are formed and such discussion will follow.
  • the main verb Central to any sentence or thought is the main verb. It is the driver of the sentence car. For any given sentence, selecting a different main verb from that sentence can lead to a drastically different meaning.
  • the main verb assigns semantic “roles” or “responsibilities” to the various grammatical constituents, and when that verb changes, the entire sentence changes.
  • the unique verb can occur in a certain manner, at a particular time, it can carry a theme, and there can be a main agent or something that experiences the verb's action. Modifiers such as roles, experiences and locations enable the transfer of words to concepts. The words are not stored—the concept behind the words are.
  • the PAS consists of multiple different roles that can be assigned to any given verb. An example of these roles are stated as follows:
  • the various roles can be assigned at different levels. At the sentence levels, there are applied “sentence roles.” These are in addition to the Semantic Roles assigned by the verb. At the sentence level, the Sentence Roles are frequently used to connect thoughts. Correlate pairs, such as “if-then” will establish a unique relationship between sentences. Adverbial conjunctions such as “however” denote contrast to a previous statement. These play a critical role in understanding relationships between thoughts.
  • FIG. 12 there is illustrated a diagrammatic view of the parser process for taking an input from tokenized text to the clumps.
  • This is referred to as the “Natural Language Parser.”
  • the parsing of natural language text is one of the most difficult challenges in computing. The subtleties, nuances, innuendos, idioms, dynamic nature and ambiguities are but a portion of the quest to accurately break down natural text into its relevant and intended meaning. Behind every text is a conceptual thought, as described hereinabove. The parser discerns the thought. Through breaking down sentences into their conceptual parts, analyzing, topic tracking, and an awareness of context, previously daunting sentences are handled. Additionally, in parsing any language one is faced with multi-tiered layers of complexity.
  • the tokenization is initiated at a block 1202 to start at the most basic level.
  • the target text must be prepped prior to attempting to parse it.
  • a tokenizing process is applied to break the text into basic groupings. In language, these are often called words, i.e., the logical semantic element. These words do not have to be official words, as they could be an unknown collection of letters or a sequence of symbols. These “tokens” are the input that drives the parsing process.
  • the next step in the process is associated with the Pre-Rules Layer, as indicated by a block 1204 .
  • the pre-parsed layer marks all the relevant tokens with flags that clue the later grammatical parser.
  • the Pre-Rules need only be executed or run one time. They are not changed due to different verb attempts, because they hold true no matter what the main verb ends up being.
  • the next step is the PAS Verb Selection at block 1206 .
  • the Predicate Argument Structure (PAS), or the main verb, is selected through a scoring system.
  • the scoring system determines which possible verbs to try. Regardless of success, other options will also be selected and tried due to the recursive nature of the parser.
  • the PAS Verb selected is the main verb. Going forward, the parser assumes this to be true and proceeds as if it were so. This enables the parser to avoid the complexities of constantly attempting to resolve the issue during the grammatical parse.
  • Post-Rules Layer is applied to the input tokens according to the assumed selected PAS Verb. In English, there are rules that can be applied once the verb is discerned. Since the parser assumes the main verb, in any given parse, the main verb has been discerned.
  • the next step in the process will be the grammatical parse, as represented by block 1210 .
  • This is a recursive process.
  • decisions When parsing text, there are many “decisions” that have to be made. Many words can operate at multiple word types. Improper grammar and punctuation is often used, and that cannot prevent the parser from its task. “Decision Nodes” have been implemented that track these decisions much through the course of the parse. An example of a decision node is the following:
  • a decision point occurs after the main verb “claimed.”
  • the PAS data for the verb “claimed” indicates that “claimed” assigns a role of “theme.”
  • This theme represents the verb “claim.”
  • the entire role itself can be a nested clause with its own PAS verb.
  • the grammatical parser cannot be certain if a nesting clause exists, if that is a relative pronoun, if it is an irrelevant keyword, or if that is a determiner.
  • a nested clause is referred to by Linguis as a “CP” or complementizer phrase. Complementizers can have their heads, or words that will lead them off, or they can be assumed. These cases would look as follows:
  • the decision node stores an enumerated set of information regarding the decision. Nodes are coated with the realm of possibility. Decision logic determines which possibility to choose and it records that choice in a log. Some nodes lead to ambiguity, while others do not. Upon failure, or success of any given parse, all ambiguous nodes will be chased. Essentially, the other choices are made and the parser attempts to parse that particular version. In handling decisions in this manner, the parsers hands are clean. There is really no decision because all decisions that leading to a valid parse are valid and acceptable at this stage.
  • the next step in the process is determining the role resolution.
  • role resolution the grammatical roles are converted to their PAS Role counterparts.
  • a subject may become an actor, an experiencer, etc.
  • the PAS verbs are defined through the use of various rules that are set subject to linguistic guidelines. These roles must then be scored, this scoring viewed as a competition. This is facilitated at a block 1212 .
  • the valid grammatical parse options are the competitors vying for the parse. There are multiple layers upon which the competition can be judged, as follows:
  • Conceptual Clumps as described herein above, are a collection of concepts, wherein words are used to convey concepts. This collection of concepts is a collection that come together to form a thought.
  • the output of the parser is a single clump at a block 1214 that is neatly stored in its conceptual form.
  • a fragment is returned, a desperation attempt is taken, etc. This is illustrated by blocks 1216 , 1218 and 1220 , each taking a separate path back to the initial tokenized block 1202 to pass through the process one additional time.
  • ambiguity is resolved at an ambiguity node 1222 . If the grammatical parse has been acceptable, it will pass to the ambiguity node 1222 . Once any ambiguities are cured, this will pass back to the PAS Verb selection block 1206 and then to the scoring block 1212 .
  • the ambiguity nodes block 1222 determines if there are ambiguities and, if so, it will pass as to the last ambiguity node and then back to the grammatical parse. If the grammatical parse fails, it will pass back to the PAS Verb selection block also.
  • Lexical ambiguity occurs when one word can mean different things. Technically, the homograph head can be interpreted differently. Words like bank, jump, chair, or cup all have multiple meanings and uses. An example of such is, “American chair seeks arms.” There are actually two lexical ambiguities here. The first, i.e., is “chair” of the American variety or is “chair” of something that is American, e.g., leader, head-of-state? The other ambiguity is are these “arms” parts of a body or parts of a chair? In general, lexical ambiguities arise when words can function as two or more parts of speech.
  • the clump basically gathers information about an action that took place and garners with all the history of that event.
  • Such clumps are repositories for the actions taken by nouns and each such clump implies what can be viewed as the content of an independent clause, with a variant handling dependent clauses.
  • a Conceptual Clump stores the parsed “thought,” And is generally used to hold the verb of the sentence, although it need not be.
  • clump neurons have their own number and space, as noted herein above.
  • the clump takes the words and phrases of the sentence and converts them to a series of semantic “roles.”
  • Three types of semantic roles drive the basic sentence clump.
  • the primary is the PAS Verb (ROLE_CLUMP_VERB). It is the main verb that assigns most of the other roles stored in the clump. It is important to note that different PAS verbs will assign different roles. Some verbs are not able to assign certain roles, and many times the roles that are assigned are restricted in various ways. These restrictions provide an aid in scoring a parse, as they will help an accurate sentence reconstruction.
  • the ROLE_TASPECT contains the tense and aspect that the PAS assigning verb utilized.
  • the last driving role at the basic sentence level is captured with one or more of the five modal roles: ROLE_M_INTENT, ROLE_M_ABILITY, ROLE_M_OBLIGATION, ROLE_M_DESIRE and ROLE_M_POSSIBILITY.
  • ROLE_M_POSSIBILITY a role associated with the “may” NID
  • ROLE_ACTOR associated with “the rabbit” NID
  • ROLE_EXPERIENCER_ a role associated with “the carrot” NID
  • ROLE_NANOR a role associated with the concept “quickly” NID.
  • command field is a 2-bit field located in every role of the clump. Its primary purpose is to allow reusability of clumps. Were clumps to proliferate, they would consume significant memory space over time.
  • a method is implemented to permit reuse of existing clumps, such as sharing a clump that is close to what is needed.
  • a base clump can be shared as a result to save storage space if a new clump is only different by “n” amount of roles (where “n” is determined by a function that decides if it actually saves any time/space through reuse of the clump).
  • An alternative to this, and possibly a preferable one, is to minimize creation of these extra clumps, using linked list structures to hold the would-be content of the clumps. The worth of each such (tree-like) structure branch would then be evaluated to determine which one should be selected to be output as a clump neuron.
  • a single neuron represents a single concept, and connections to other neurons establish its meaning, representation and purpose within the context of the entire database.
  • the clumps utilize “semantic roles” to capture the history of some aspects of the concept within the brain. Establishing these semantic roles for nouns and verbs allows the system to more easily restate things.
  • the allocation of roles (and rules for their use) to words are used as an existing hierarchy of words. With or without them, the system can choose alternative word-rendering or the same concept, but roles make that choice easier.
  • the semantic roles thus act as a “glue” that takes the meaning implied by an existing neuron interconnection—with or without hierarchy—and guides their transformation into sentences where needed. Before proceeding further, it may be helpful to describe the general concept of semantic roles in terms of the PAS.
  • all nouns in a sentence are considered to be arguments to a verb wherein each verb assigns one label, called a semantic role, to each noun.
  • the total of semantic roles for a particular verb is the Predicate Argument Structure, or PAS for that verb.
  • PAS is just one of multiple parsers that could be utilized herein and it is anticipated that other parsers could be utilized.
  • this system In general, the goal of this system is to separate the semantics, or meaning, from the syntax, or form (such as word order). This system is designed to handle the various ways that one can change word order or even tense in English, but retain the same meaning. There are provided three examples of this word-order change. They are as follows:
  • Nested Sentence Clumps in addition to the simple clump described in FIG. 13 , there are also provided Nested Sentence Clumps. These are slightly more complex than a basic clump in that this type of clump contains a nested clump functioning as an entire role assigned by the PAS verb sentences with relative adjectives (Radjs), nested WH roles, and gerunds are examples of this. For example, the sentence, “The rabbit that bit me may quickly jump over the carrot.” This illustrates clump “2001” referring to clump “2002” in the role, ROLE_ACTOR.
  • ROLE_CLUMP_VERB This shows the additional role, ROLE_CLUMP_VERB, as being associated with the word “bit” and the role, ROLE_EXP, as being associated with the word “me.”
  • relative adjectives Radj
  • a Radj is used to determine the noun. It is helpful to resolve a noun by determining “which one” one is considering. However, if the clump Radj forms is new information, it should be stored. The system will either resolve a noun or it will have to create a new concept that awaits resolution with a high priority flag.
  • a more accurate way to store the information is with a Controller Clump at a higher level, as described herein above.
  • the assumption of the controller is that it progresses linearly, i.e., it is a linear flow of concepts. Connected clumps are stored in a linear fashion. If one is not explicitly expressed, the system will store them in a logical progression (i.e., Want, then, Realization, Expectation, then Reality).
  • the controller is the essence of the connected thoughts and this is illustrated in FIG. 15 .
  • the clump “2001” is defined as the Controller Clump. This has two roles, the SENT_SEQ role and the SENT_CONTRAST role. The first role is assigned or points to the resolved NID of clump “2002,” and the second points to the resolved NID of clump “2003.”
  • Clump “2002” has a plurality of roles associated therewith. This is for the words “want,” “Luke,” and “peanuts.”
  • the verb “want” is the main verb for this clump and occupies the role, ROLE_CLUMP_VERB.
  • the word “Luke” takes the role of the actor and the word “peanuts” takes the role of the experiencer.
  • the owner is the Controller Clump, clump “2001.”
  • the clump “2003” is associated with the words “got,” “Luke,” and “cashews.”
  • the main verb in this clump is the verb “got,” the actor is the word “Luke,” and the experiencer is the word “cashews.”
  • the owner again, is clump “2001.” With this arrangement, storing at the higher level will allow for a greater diversity of expression on the output side and also for a strong ability to match concepts in the analyzing process (such as for answering questions).
  • FIG. 12 a an exemplary flow diagram illustrating linking between clumps is shown.
  • the system has various types of neurons of which Experience neurons and normal neurons link to clumps.
  • Experience neurons all other neurons capture knowledge, relationships and facts, but the Experience neuron captures the emotions, timing and expectations of an experience. In this way, this neuron defines the essential concept of an experience.
  • the neuron IDs for experiences have their own numbering space.
  • Experience neurons capture a further level of detail and nuance that comes with first person experience.
  • the core of an experience is a clump, thus it will be linking to clumps.
  • clump All normal neurons are representative of a “concept” and from these concepts, there is linkage to groupings of concepts (the clump). “A dog that jumped a road, ate a bird and then went to sleep,” links to a controlling clump representing the entire sequence.
  • a “conceptualizer” logic decides what to do with the clump. It may need to be broken down and stored as class level properties, if or even a process neuron.
  • Emotional impact of any clump can be stored by adding a ROLE_EMOT reln in the clump.
  • a particular statement of a particular political speech may have incited anger, and that could be stored in the exact offensive sentence level clump.
  • the speech itself may not have been offensive, thus the “political speech” clump would not get the ROLE_EMOT.
  • clumps are a collection of concepts/thoughts, and by interconnecting thoughts with neurons and assigning emotional impact to them we increase flexibility of the system and we are one step closer to emulating the human brain.
  • the Outline Clump is much like the table of contents for a book. It gives the overall purpose or subject of a discussion and provides a Table-of-Contents-like layout of the material.
  • a single OC for example, summarizes the contents of a training document the brain has absorbed. Another might outline a movie or an experience such as my “Hawaiian Vacation of 1999”.
  • the OC is a holding place for a series of Controller Clumps (CCs) arranged in a hierarchical fashion.
  • CCs Controller Clumps
  • sentences are comprised of Semantic Clumps (SCs).
  • SCs Semantic Clumps
  • Each paragraph for example, is managed by a Controller Clump that includes a number of SCs and a final summary SC. If one is dealing with a book they are reading, higher-level CCs are arranged to account for sections of a chapter. Above them, an Outline Clump (OC) orders the chapter CCs into an overall book. It is possible to discard all the lower-level semantic clumps (SCs) except for the summary clumps, if desired, to provide a synopsis of the book.
  • SCs Semantic Clumps
  • Each CC and OC has a summary clump attached to it, regardless of the level in the structure. For example, a “section” CC has below it a summary SC for every paragraph. The section itself has a summary SC that is a “roll-up” of the SCs for all the paragraphs below it. Finally, the Outline Clump (OC) has a summary SC that summarizes what the book itself is about.
  • the OC differs from other types of clumps in that it can wrap up both textual or discussion content as well as experiences (the “expers”, as discussed hereinabove).
  • the types of knowledge in an Experience neuron is organized quite differently from clumps and conceptual neurons. It includes such things as expectations and anticipations (met or not) and various emotional and time stamps that relate to the experience.
  • the OC has the least information of all clumps but contains the highest-level synopses of multiple types of knowledge. It is truly an outline or staging point from which to recall knowledge from many levels.
  • a proposed new clump neuron may be similar to an existing clump neuron (“clump”). It may be shorter to create a MAKE_FROM neuron using the original clump as a pattern or template, rather than creating the full clump that largely replicates an existing item.
  • the process for creating a non-duplicate clump takes the following steps:
  • a “matched” clump is one that has an identical role table set. That is, all of the roles are the same, and the role values are the same (the neuron IDs the roles point to). Alternatively, a partial match can exist wherein some roles match up exactly, but others do not.
  • the system must then determine what threshold will create a new clump vs. reusing an existing one. As an example, consider the phrases “The rabbit bit Thomas,” “The rabbit bit Luke,” and “Skip bit the rabbit.” This is illustrated in FIG. 16 .
  • Non-weighted reins are listed below.
  • the entire 8-bit Cmd field is used in the enumeration.
  • the enumeration value itself is not given because it changes from time to time. It should be understood that these are just representative of the type of information that can be stored in a relational neuron and is not meant to be exhaustive, and that any type of relationship can be so crafted.
  • Reln Cmd Code Usage Usage of 24 LSBs R_ASSOC Association NID is a neuron associated with this one.
  • the mud neuron may have an RASSOC pointing to earth, and earth has an identical one pointing back to mud.
  • This reln is fully symmetric. It acts as an alternative to actively-firing neurons. See R_SPLIT for further information and usage.
  • R_BLOCK Gamut (or other This reln indicates that a block of data will follow that is to be list) follows as a processed or ignored as a whole. (This replaces the former block of relns R_GAMUT reln.)
  • the bits 0 . . . 7 are # elements; bits 8 . . . 15 are block type.
  • the irregular verbs have a present tense, past tense and past participle tense, and the three are laid out as 3 elements of a block of type R_IRR. Each has an R_IRR reln pointing back to the present-tense form.
  • R_CLUMP Action (from CID links to a clump neuron. For “The cow jumped over the noun) moon,” the cow neuron (the noun actor) would contain an R_CLUMP pointing off to a clump that describes the action.
  • R_VCLUMP This differs from R_VCLUMP in that an identical clump CID would be pointed to by the jump (verb) neuron to the same clump. That allows access to the information either from the actor or the action side.
  • R_CPLX Complex Installed in the parent neuron, this points to a derived child neuron ‘complex’. It may be for the purposes of adding an adjective (“blue dog” or “seat belt”), or for creating an instance from a parent class.
  • the NID is the complex derived from this (noun), and the complex has a back-reln of R_CPLX_OF pointing back to the parent.
  • R_CPLX_OF Complex-of The first reln of a complex neuron, this is the back-reln to (noun) parent of the complex. NID is noun this complex is derived from.
  • R_EMOT Attached Used to attached emotional content to a neuron (e.g., to give Emotion ‘warm fuzzies’ or concern), this reln directly indicates the emotion to be evoked.
  • the 24 LSBs are composed of a 6-bit Firing level, a 6-bit Count, and then a 12 Neuron_Id to a special neuron (of emotion). The Count is used to . . .
  • R_IDENT_INST Link to an The “Jack” neuron has an R_IDENT_INST to each instance of identity instance. Jack. Each such instance of a normal neuron will have an R_IDENTITY with an Iid pointing off to an Identity neuron. R_IDENTITY Link to an This reln links the current neuron to a specific identity neuron. identity The 24 LSBs are an IID pointing to the identity neuron. (The identity neuron is separate from the neuron for a person around which his character and experiences are built. The IID contains certain things specific to all identities, such as name, birthplace, birth-date, and so on.) R_INST Instance This reln defines an instance of a class, e.g., Fido versus dog.
  • R_INST_OF Back-Reln is the back-reln to an RINST. Found in the instance Instance neuron, its' NID points back to the parent class or concept neuron. See R_INST for more details.
  • R_MADEOF Made-of Cakes are made of flour, eggs, water, milk and several other ingredients.
  • This reln may be used inside a noun neuron to indicate what that noun is made up of.
  • R_MADEOF for each ingredient, with the NID in the 24 LSBs pointing to the constituent neuron.
  • Each of the constituent ingredients in turn has a back-reln of R_PARTOF pointing back to the ‘cake’ (noun) neuron.
  • R_NOT Negation Target is element of this class, Compl the cdx.
  • R_OP Object of prep Specifies the object of the preposition (paired with R_PREP). Prep phrases are attached to a noun only for ‘SUCO’ (subject- complement) cases, in which there is no time tense associated with the connection.
  • R_PREP reln Whenever an R_PREP reln is found, it will be immediately followed by an associated R_OP.
  • R_OP_OF Back reln of OP This is a back-reln for an object-of-the preposition and points back to the noun that prep phrase is attached to. See R_OP for details.
  • R_PARTOF Back-reln An object is made up of other items, and will have R_MADEOF converse of relns pointing to the items that make it up. Conversely, the parts MADEOF that make up the item each have an R_PARTOF back reln that indicates the class of object of which it is a part of.
  • R_PROP Phys Property Any noun may have one or more property lists that describe some (list) physical aspect of an object. These may include color, texture, size and the like and are defined in the chapter on Noun Properties.
  • the variable-length lists are recorded in an R_BLOCK list, with the length given in the 8 LSBs, and with the R_PROP enumeration given in bits 8 . . . 15 of the R_BLOCK. R_PROP never stands alone as a reln.
  • the NID is the parent neuron.
  • R_CDX Percent of This is the means of connecting an adjective to a noun. “Rather Target Firing red” may imply a red weighted at 30%. The NID points to the adjective.
  • This role includes all time- or date-related phrases, including concurrence.
  • Time may be a prepositional phrase, an adverb phrase, or an appropriate proform.
  • This role includes all durations, such as “The heated discussion lasted for three hours,” and “I've eaten nothing but cookies and coke for a year.”
  • Duration may be a prepositional phrase headed by “for” followed by a period of time, a propositional phrase headed by “during” followed by a noun which denotes something happening over a period of time, or a complementizer phrase connected with ‘while’ or ‘as’.
  • Reason may be a prepositional phrase beginning with because, a complementizer phrase beginning with the complementizer so, or in order that.
  • Manner may be a prepositional phrase or an adverb phrase.
  • This semantic role denotes the entities which accompanied the subject of the sentence. For example, “I went to the software store with Anne-Marie.” or “Go with God.”
  • the subject of the sentence need not be the Agent, as in “She was attacked with her friend Stacy.”
  • Accompaniment may be a prepositional phrase beginning with alongside or with, or it may be the adverb phrase together.
  • the Agent is the initiator of the action; the Instrument is the thing that the Agent used.
  • Instrument may be a noun phrase or gerund phrase.
  • Actor may be a noun phrase, gerund phrase, or complementizer phrase.
  • Experiencer is the noun which undergoes change or feeling, as in “Jonny feels very sad today,” or “He attacked me without warning!” Note: All state verbs must assign an exp in active cps.
  • the verb “to become” assigns an experiencer based on the following line of reasoning: The verb “to change,” as in “Rudolph changed Nancy into a nicer person,” assigns an Actor (Rudolph) and an Experiencer (Nancy). In “to become,” as in “I became nicer,” the Actor is changing himself to be nicer. In our model, we have called him the Experiencer, though this is arbitrary and he just as easily could be the Actor.
  • Experiencer may be a noun phrase, gerund phrase, or complementizer phrase.
  • Themes may be either abstract nouns or gerunds, or complementizer phrases.
  • An Effect may only be assigned if the Effect noun came into existence because of the verb. That is, you must be able to replace the original verb with “effect” (not “affect”), and the sentence must hold true.
  • Effect may be a noun phrase OR an adjective due to a state or state verb—I made her angry. ‘her angry’ is the effect.
  • Identity was created for, but not limited to, being verbs, as in “I am Hannah”. “Hannah” receives the role of the identity, because it refers to the same entity as does the subject. Any noun phrase that is not the subject but is its coreferent is assigned the role of identity. Identity burns a implies reln from the exp.
  • Identity may be a noun phrase, gerund phrase, or complementizer phrase.
  • Measure may be a noun phrase or gerund phrase. This is the only role which can be a NP, but cannot function as the Object of a sentence.

Abstract

A method for parsing the flow of natural human language to convert a flow of machine recognizable language into a conceptual flow includes, first, recognizing the lexical structure and then, a basic semantic grouping is determined for the language flow in the lexical structure. The basic semantic grouping is then determined that denotes the main action, occurrence or state of being for the language flow. The responsibility of the main action, occurrence or state of being for the language flow is then determined within the lexical structure followed by semantically parsing the lexical structure. Thereafter, any ambiguities in the responsibilities are resolved in a recursive manner by applying a predetermined set of rules thereto

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 12/101,904, filed Apr. 11, 2008, now U.S. Pat. No. 8,725,493, issued May 13, 2014, entitled NATURAL LANGUAGE PARSING METHOD TO PROVIDE CONCEPTUAL FLOW (Atty. Dkt. No. VISL-28900), which claims benefit of U.S. Provisional Application Ser. No. 60/911,099, filed Apr. 11, 2007 and entitled “NATURAL LANGUAGE PARSING AND THE NEURIC BRAIN MODEL” (Atty. Dkt. No. VISL-28269), U.S. Provisional Application Ser. No. 60/954,247, filed Aug. 6, 2007 and entitled “RELATIONAL INTER-NEURON CONNECTIONS” (Atty. Dkt. No. VISL-28402) and U.S. Provisional Application Ser. No. 61/016,918 filed Dec. 27, 2007, and entitled “DISCUSSION PATENT” (Atty. Dkt. No. VISL-28693) and is a Continuation-In-Part Application of U.S. patent application Ser. No. 11/758,667, filed Jun. 5, 2007, now U.S. Pat. No. 7,925,492, issued Apr. 12, 2011 and entitled “METHOD FOR DETERMINING RELATIONSHIPS THROUGH USE OF AN ORDERED LIST BETWEEN PROCESSING NODES IN AN EMULATED HUMAN BRAIN” (Atty. Dkt. No. VISL-28338), which application claims benefit of U.S. Provisional Patent Application Ser. No. 60/811,299, filed Jun. 5, 2006, and entitled “THE CONCEPT OF GAMUT AND ITS USE” (Atty. Dkt. No. VISL-27690), and is a Continuation-in-Part of pending U.S. patent application Ser. No. 11/697,721, filed Apr. 7, 2007, now U.S. Pat. No. 8,001,067, issued Aug. 16, 2011, and entitled “METHOD FOR SUBSTITUTING AN ELECTRONIC EMULATION OF THE HUMAN BRAIN INTO AN APPLICATION TO REPLACE A HUMAN” (Atty. Dkt. No. VISL-28262), which is a Continuation-in-Part of pending U.S. patent application Ser. No. 11/670,959, filed Feb. 2, 2007, and entitled “METHOD FOR MOVIE ANIMATION” (Atty. Dkt. No. VISL-28177), which is a Continuation-in-Part of pending U.S. patent application Ser. No. 11/425,688, filed Jun. 21, 2006, now U.S. Pat. No. 7,849,034, issued Dec. 7, 2010, and entitled “A METHOD FOR INCLUSION OF PSYCHOLOGICAL TEMPERAMENT IN AN ELECTRONIC EMULATION OF THE HUMAN BRAIN” (Atty. Dkt. No. VISL-27693), which is a Continuation of U.S. application Ser. No. 11/154,313, filed Jun. 16, 2005, and entitled “METHOD FOR INCLUSION OF PSYCHOLOGICAL TEMPERAMENT IN AN ELECTRONIC EMULATION OF THE HUMAN BRAIN”, now U.S. Pat. No. 7,089,218, issued Aug. 8, 2006 (Atty. Dkt. No. VISL-27175), which is a Continuation of abandoned U.S. application Ser. No. 11/030,452, filed Jan. 6, 2005, and entitled “A METHOD FOR INCLUSION OF PSYCHOLOGICAL TEMPERAMENT IN AN ELECTRONIC EMULATION OF THE HUMAN BRAIN” (Atty. Dkt. No. VISL-27019); which claims the benefit of U.S. Provisional Application for Patent Ser. No. 60/534,641, filed Jan. 6, 2004 entitled “A NEURIC BRAIN MODELING SYSTEM IN THE MILITARY ENVIRONMENT” (Atty. Dkt. No. VISL-26569), U.S. Provisional Application for Patent Ser. No. 60/534,492, filed Jan. 6, 2004, entitled “METHOD FOR INCLUSION OF PSYCHOLOGICAL TEMPERAMENT IN AN ELECTRONIC EMULATION OF THE HUMAN BRAIN” (Atty. Dkt. No. VISL-26570), U.S. Provisional Application for Patent Ser. No. 60/534,659, filed Jan. 6, 2004, entitled “DESIGN OF THE NEURIC BRAIN” (Atty. Dkt. No. VISL-26571), all now expired, and which U.S. application Ser. No. 11/697,721, filed Apr. 7, 2007, and entitled “METHOD FOR SUBSTITUTING AN ELECTRONIC EMULATION OF THE HUMAN BRAIN INTO AN APPLICATION TO REPLACE A HUMAN” (Atty. Dkt. No. VISL-28262) also claims the benefit of priority from U.S. Provisional Application for Patent Ser. No. 60/764,442, filed Feb. 2, 2006, and entitled “USE OF THE NEURIC BRAIN MODEL IN MOVIE ANIMATION” (Atty. Dkt. No. VISL-27537) and U.S. Provisional Application Ser. No. 60/790,166, filed Apr. 7, 2006, and entitled “BRAIN MODEL” (Atty. Dkt. No. VISL-27620.) All of the above are incorporated herein by reference in their entirety.
  • TECHNICAL FIELD
  • The present invention pertains in general to artificial brains and, more particularly, to the aspect of storing information as a plurality of individual concepts that are capable of being connected to each other through predetermined relationships.
  • BACKGROUND
  • When dealing with a processing system that emulates a human brain, multiple techniques have been used in the past. One is to utilize a first principals system which defines the artificial brain in the form of a plurality of rules and various hierarchical structures. For example, some linguistics systems utilize a database comprised of a plurality of linguistic rules that are used to parse a sentence into its individual parts, such as subject, verb and object and then organize sentences as groups of components into paragraphs and the paragraphs into an overall outline. Some of these systems have rules to determine the concepts behind particular sentences and paragraphs, and even generate an overall outline. A second type of system utilizes a neural network which is comprised of a plurality of weighted nodes that are interconnected with input and output nodes. The various internal nodes, referred to as activation nodes, are trained to provide an overall representation of a system. These are non-linear networks and each node has a function that is determined by training and the such. No one given node has a predefined relationship. Any type of nodal system utilized to represent a series of complex relationships typically requires a plurality of individual node, each defined in terms of activation levels, such that the “firing” of a particular node based upon an input or an event will cause the firing of sequential nodes to represent an overall concept. The use of such nodes does not lend itself to defining concepts in the discrete manner or of interconnecting such concepts in an adequate manner.
  • SUMMARY
  • The present invention disclosed and Claimed herein, in one aspect thereof, comprises a method for parsing the flow of natural human language to convert a flow of machine recognizable language into a conceptual flow. First, the lexical structure is recognized and then, a basic semantic grouping is determined for the language flow in the lexical structure. The basic semantic grouping is then determined that denotes the main action, occurrence or state of being for the language flow. The responsibility of the main action, occurrence or state of being for the language flow is then determined within the lexical structure followed by semantically parsing the lexical structure. Thereafter, any ambiguities in the responsibilities are resolved in a recursive manner by applying a predetermined set of rules thereto.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • For a more complete understanding, reference is now made to the following description taken in conjunction with the accompanying Drawings in which:
  • FIG. 1 illustrates an overall block diagram of a general artificial brain and the associated neuron database;
  • FIG. 2 illustrates the concept organization;
  • FIG. 3 illustrates a diagrammatic view of one concept flow;
  • FIG. 4 illustrates a block diagram of a plurality of neurons showing the interconnection thereof through relationships;
  • FIG. 5 illustrates a diagrammatic view of various relationship flows defined as clumps;
  • FIG. 6 illustrates an alternate diagrammatic view of the organization of clumps;
  • FIG. 7 illustrates a block diagram of the general overall artificial brain and the operation of receiving a textual input and providing an output;
  • FIG. 8 illustrates a diagrammatic view of the data fields for a relational neuron;
  • FIG. 9 illustrates a nodal array for one conceptual flow;
  • FIG. 10 illustrates a detail of the nodal array of FIG. 10;
  • FIG. 11 illustrates a diagrammatic view of interconnections between neurons;
  • FIG. 12 illustrates a diagrammatic view of the parser;
  • FIG. 12 a illustrates a diagram of the system linkage to clumps; and
  • FIGS. 13-18 are tables illustrating clump definitions.
  • DETAILED DESCRIPTION
  • Referring now to the drawings, wherein like reference numbers are used herein to designate like elements throughout, the various views and embodiments of a method and apparatus for defining an artificial brain via a plurality of concept nodes connected together through predetermined relationships are illustrated and described, and other possible embodiments are described. The figures are not necessarily drawn to scale, and in some instances the drawings have been exaggerated and/or simplified in places for illustrative purposes only. One of ordinary skill in the art will appreciate the many possible applications and variations based on the following examples of possible embodiments.
  • Referring now to FIG. 1, there is illustrated a general block diagram of the processing system that represents the artificial brain. In general, the brain processing system is referred to by a reference numeral 102. It is operable to receive an input on an input line 104 and provide an output on an output node 106. Input 103_can receive inputs such as speech, text or some sensory input and the output on node 106 can be an action, text or the such. It should be understood that any type of input can be accommodated that provides some sort of information to the system, either static or dynamic. In one implementation of the artificial brain, the system will have a discourse with a user. The user will input text in the form of a query (or input voice that is translated to text or some similar machine recognizable form). For example, the input may be a question that may ask the system to “show me the tree,” which is then interpreted by the system to possibly display a picture of a tree. It may be that the system detects some type of ambiguity in that it does not understand whether the query is trying to ask it about a plant or a graph, as the term “tree” could be the plant defined as a tree or it could be a graph that is in the form of a tree structure. In this situation, the output would be a question to the user asking whether the user means the plant or the graph. Another input would then be received indicating that it was the plant. The system would then process this further. However, if the user had indicated the query as show me the tree that was to be cut down, it is then possible, based upon the level of training in the artificial brain to determine that there is some type of learned association between the concept of “cut” and the tree as a plant as opposed to the tree as a graph. In that situation, no additional query would be required. Of course, this is a very simple example of the type of input that could be received and the type of output that could be provided.
  • Additionally, in the flow of the discourse, it could be that the context was sufficient to discern that a tree was the plant type as, for example, a previous query could have been directed towards walking in the woods to find a “tree.” This context is maintained in an ongoing contextual list. In this situation, the brain itself can resolve an ambiguity and no clarifying query would be required.
  • The brain processing system 102 interfaces with a nodal array 104 comprised of a plurality of individual neuron nodes, hereinafter referred to as “neurons. The nodal array 104 is operable to store in each of the plurality of neurons a unique concept. This concept can relate to a hard object or an abstract idea. For example, the concept could be the concept of a tree in a plant based context. It could also be the concept of a “truck.” Further, it could be related to the concept of a “fire truck.” Any complex number of words could define a particular concept and this particular concept does not necessarily require that it be a single word but, rather, it can be a combination of words and could even include the concept embodied in a phrase.
  • As will be described herein below, each concept has some type of relationship with other concepts, when taken in the context of a sentence. In order to have any type of discourse or to relay a particular overall concept in a discourse, a plurality of individual concepts are organized in a particular sequence and organization. Therefore, it is the structure and sequence of an individual concept in a particular manner which defines an idea or a “conceptual flow.” The details of this conceptual flow will be described in more detail herein below.
  • Referring now to FIG. 2, there is illustrated a diagrammatic view of the concept universe. There are illustrated a plurality of nodes (neurons, noting that “node” and “neuron” will be utilized interchangeably throughout), each representing an individual concept. One could be the concept of a particular verb, one the concept of a particular noun, etc. Each of these concepts has a group of relationships associated therewith. For example, a particular sentence could be utilized in a discourse wherein it was stated that “A leopard named Egor is in the cat family and has white fur and wears a red collar.” There are a plurality of relationships that exist within this sentence. First, if the relationships did not previously exist, there would be a neuron for the concept of leopard created and a relationship generated between leopard to a previously existing neuron defined as the “cat” neuron. This would be created in such a manner wherein the relationship created is defined as being the “parent” relationship between the “leopard” concept and the “cat” concept, i.e., a “parent” relationship. There would also be an “instance of” relationship created between the leopard neuron and the name “Egor.” There would be a “possession” relationship between the “Egor” neuron and a neuron for “red collar.” If the concept “red collar” previously existed as a concept for a neuron, then the relationship would be created in the “Egor” neuron to the “red collar” neuron. There would also be created what is termed a “back relationship” from the “red collar” neuron to the “Egor” neuron. This will be described in more detail herein below when specific relationships are referred to. For the rest of this discussion, relationships between neurons will be called “relns” or a relationship “reln.” An additional relationship would be an attribute which would show that an attribute of the specific “Egor” neuron would be directed to the “white” neuron, assuming this concept embodied in a neuron previously existed.
  • When defining a particular grouping, i.e., a sentence, all that is necessary is to define the concepts that are defined in this grouping, a sentence making a logical grouping in natural language, and then define the relationships between the various concepts. Once the concepts in a grouping, i.e., a sentence, and the relationships between those concepts, is defined, the overall idea or concept of the sentence or logical grouping is defined. This can be further extended to a paragraph, a chapter in an overall book, etc. Again, this will all be described in much more detail herein below. Referring back to FIG. 2, there are illustrated a plurality of neurons, each defining concepts. There is defined a first grouping 202 and a second grouping 203. These each being independent groupings. A particular grouping of concepts and their relationships would be associated with, for example, a particular sentence.
  • Referring now to FIG. 3, there is illustrated a diagrammatic view of a conceptual flow in a particular overall conceptual flow. The following sentence is that which is represented by the conceptual flow of FIG. 3:
      • A dog is a domestic animal. Dogs are companions. Dogs are members of the canine family and they are hairy. A dog I know of is named Fido.
  • The sequence flow would then go between various neurons. In this example, there is the “dog” neuron, the “domestic animal” neuron, the “companions” neuron, the “Fido” neuron, the “canine” neuron and the “hairy” neuron. The flow of a sentence would be the first relationship between “dog” and “domestic animal” along a first path represented by {circle around (1)}. There would be a second flow from “domestic animal” back to “dog”, as there is a relationship from “dog” to “domestic animal” which is one of parentage, i.e., “dog” is a member of the “domestic animal” category. The relationship between “domestic animal” and “dog” is one where “dog” is a child of the “domestic animal” category (a back_reln). There will then be the next relationship defined which is between “dog” and “companions.” There will be a relationship at the “dog” neuron indicating that the “companions” neuron is an associative relationship, which will have a back relationship with the {circle around (4)} relationship. Thus, dog will have a stored relationship to “companions” and companions will have a stored relationship back to “dog.” It is understood that stored in the “dog” neuron is the relationship to other neurons and not the relationship “from” other neurons, i.e., all that is important for the dog neuron is to know that it has an associative relationship to the companion neuron and not to know that the companionship neuron has a back relationship to the dog neuron. In a similar manner, the next relationship in the flow will be the relationship of the “dog” neuron to the “canine” neuron, which is one of parent/child. The “dog” neuron indicates that there is a parent relationship to the “canine” neuron and the “canine” neuron will indicate a child relationship back to the “dog” relationship. Thus, the “child” relationship is not stored in the “dog” relationship. There will then be a relationship defined between “canine” and “hairy,” this being an associative relationship. The last relationship will be the instance_of relationship to the “Fido” neuron. There will be a relationship in one direction to “Fido” that is an instance relationship and the back relationship from “Fido” to “dog” will be a “instance_of” relationship. These relationships will be described in more detail herein below.
  • As noted herein above, the nodal array and the overall information stored for the artificial brain is comprised of a plurality of concept neurons that are interconnected together with various relationships. That is to say, one particular neuron may have thousands of potential relationships to other neurons. For example, the “dog” neuron may be related to the “canine” neuron, the “dog body” neuron, the “fur” neuron, etc. However, until some conceptual flow occurs such as that associated with a particular sentence, these relationships mean nothing. It is when the concept flow is disposed in some organized and structural conceptual manner that these concepts give meaning to a particular overall idea that is conveyed by the sequence, structure and organization of the particular grouping and flow of concepts.
  • Referring now to FIG. 4, there is illustrated a single concept node or neuron 404 that is associated with a plurality of other concept nodes 406 labeled concept node b, concept node c . . . concept node N. Each of the concept neurons or nodes has associated with it a set of properties in a block 408 defining certain aspects of a particular neuron. For example, if the concept neuron were the concept of a branch of a tree, the property block might include such things as it being thin, having a size less than a particular diameter, being made of wood, fiber, etc. These are certain things that are fixed with respect to the properties thereof. Although not discussed in the parent applications herein, which were incorporated herein by reference, it should be understood that concept neurons can be created with certain properties such as the type of word they are, i.e., noun, verb, etc., and they can have defined properties associated therewith. Alternatively, these concept neurons can be “learned” neurons. There can be text neurons, there can be identity neurons, etc. However, each neuron or node constitutes a particular concept of either a hard object or an abstract idea. It is defined in the system by a neuron ID “NID” which is all that is necessary to refer one neuron to another, i.e., it is a flow of NIDs. {????}
  • Referring back to FIG. 4, the particular neuron 404 is defined as having a plurality of relns to the other neurons or nodes 406. There can be thousands of these relns which can number as high as 32K relns per neuron. These relns are useful when querying the knowledge set that includes the nodal array of neurons about particular relationships. For example, it may be desirable to determine information about a dog. The nodal array would go in and determine the closest relationships and then the most distant relationships. For example, it might first determine that dogs are canines and that dogs possess a dog body. It then may go one more step and determine that, since the dog exists in a dog body, it possesses paws and then a further relationship would be that it possesses claws because it possesses paws. The direct relationship would be the immediate relationship to a particular neuron and then a more distant relationship, i.e., removed by one relationship, would be a further relationship. The query could be defined in terms of specific relationships, one example being a “made_of” relationship that defines some neuron that constitutes the makeup of a particular object. A dog body, for example, would be made of paws, a tail and hair, for example. One could query what is a dog made of and the database would look to the closest made of relationships for the purpose of responding to the query. Again, all the database query has to do is look to the relationships to determine how to generate a particular response.
  • Thus, referring back to FIG. 4, all of the potential relationships, which are direct relationships, flow to the concept nodes 406. Each of the concept nodes 406 can then flow to other concept nodes, illustrated as concept nodes 410 with respect to concept node B, and concept nodes 412 are associated with concept node C. Thus, if the relationship desired were that of two steps down from concept node 404, it is possible to determine a relationship between concept node 404 and concept node 410 by allowing the relationship to be taken two steps down. By defining the query in terms of the relationship, the particular relationship path can be defined. Thus, until a query is made or a sentence is constructed, all of the relationships between particular concept nodes are “potential” relationships.
  • From the standpoint of how a neuron is defined, a word can be parsed from a particular sentence that exists within a particular paragraph or discourse and “conceptualized” into the particular neuron, wherein the properties of that neuron may be defined and the relationships between that particular concept and other concept neurons defined. There in the conceptualization process, the relationships are defined between various other neurons that exist within the particular discourse. It may be that the discourse results in a new relationship that will be presented. For example, if the discourse indicated that “the dog has a red collar.” The indication would be that there is a POSSN (possession) reln between the “dog” neuron and a “red collar” neuron, which may have previously existed. These relationships would result in the “dog” neuron, which already exists, possibly having a POSSN reln pointing to the “red collar” neuron and a POSSN_OF reln stored at the “red collar” neuron pointing back to the “dog” neuron. This is how a relationship would be defined as a result of a particular input discourse, this being described herein below in more detail. In general, as noted herein above multiple times, every unique concept to be known is embodied as a single neuron. While all neurons are essential identical, the truly important information is not what is stored within them (or their specific characteristic) but how they are interconnected with other neurons. It is these connections that define a concept for what it is, i.e., its unique relationships with other concepts within the system. These relationships can be fixed or they can change. Some can be temporal, i.e., they disappear, or not. As will be described herein below, most of these relns, i.e., the relational connections, are non-weighted, but there are some that are weighted. There are some that are bi-directional connections between two neurons such as the ASOC reln and other concepts required to separate relns, such as the PARENT and CHILD relns. In general, when discussing the general context or flow of concepts, one has to view communications of a concept or a thought. In general, words are used to communicate a concept or a thought. The predominant memory mechanism of human beings is a storage of the concept or thought, not the words by which it was conveyed. There are memory mechanisms that do store exact text, such as memorizing the Declaration of Independence, but this is not addressed herein. The matter of conceptual memory exists across all cultures, because all languages intend to convey something through the use of words. The use of the concept nodes and the relational interconnections is provided to represent overall concepts, ranging from the simplest to the more complex concepts. A listener, in general is like a parser—a translator of text—trying to arrive at the greater meaning that words in and of themselves attempt to convey. Text comes in through various channels and it is broken down and processed.
  • Concepts are remembered in one of four basic types of neurons. The first is a simple neuron. This is one wherein a single concept is stored in a single neuron or node, where the neuron is simply a place-holder for that concept. As described herein above, relational connections, “relns,” are provided to other concepts (or words) that give the neuron meaning within the overall context of the brain to provide a knowledge set. These simple neurons store relationships between concepts in a factual manner. For example, the word “dog” forms a “dog” neuron, and that neuron is linked through a reln to other neurons that help to establish the concept of the dog in the system. This may be simply a single neuron or multiple neurons requiring multiple relns. Neuron IDs (NID) for both simple and complex neurons share the same number space.
  • Complex neurons are another type of neuron, which are required in that there are occasionally repeated references to nouns that have a qualifying adjective, such as “black dog.” A complex neuron is created to handle such cases. The complex neuron and its associated concept carries all the implications of “dog” and the qualification of “red,” for example, but without the overhead of its own copy of all the linkages. This is due to the fact that the complex “red dog” concept can be viewed in a single concept, albeit requiring two words to convey that concept in text.
  • By comparison, when referring to a specific dog (such as “that dog”) that has particular traits or associations, a simple concept neuron is created. The simple neuron refers to a specific instance of “dog” and is connected to “dog” by an INST_OF “dog” and from “dog” to “that dog” by an INST reln.
  • An additional type of neuron is referred to as the “clump” neuron. This is the concept of gathering “clumps” of information about an action that has taken place and garners with it all the history of that particular action or event. As such, every clump neuron contains a connection to a verb (a distinguishing characteristic). Such clumps are the repository for actions taken by nouns, and each such clump implies what can be viewed as the content of an independent clause, with a variant handling dependent clauses. In general, the brain parses a sentence and outputs a single Conceptual Clump which stores the parsed “thought.” Conceptual Clumps store the thought and not the words, i.e., it is the semantic organization of concepts for that particular sentence, it being understood that a sentence is normally thought of as defining a particular single thought (keeping in mind that some sentences convey multiple thoughts and they have to be dealt with separately). In storing just the thought or the sequence of concepts, the capability of capturing a diverse collection of input streams or concepts (i.e., “concept flows”) is provided for. The possibility of analyzing these different streams as being conceptually equal is also accounted for, and providing a diverse range of sentence regeneration for creating an output is also provided for. This will be described in more detail herein below in defining conceptual flows. In general, as will further be described herein below, one Conceptual Clump is a basic sentence clump that, at its most basic level, takes the words and phrases of a sentence and converts them to a series of semantic roles, one of which is the PAS verb wherein the main verb that assigns most of the other roles of a particular clump are defined. Further, as will be described herein below, each Conceptual Clump is defined as providing a role. It is both a neuron that has a defined set of concepts associated therewith such that it points to these particular concept neurons with a particular reln defining the relationship of that particular concept with the “clump” (that being a sentence) and also defines the role of the clump. There are multiple roles associated with clumps, however, which will be described and set forth herein below.
  • Referring now to FIG. 5, there is illustrated the idea of how conceptual flows can be represented. There is a type of clump that is called a Sentence Clump (SC) that defines particular neurons associated therewith and the flow thereof. These are illustrated as a linear flow but they are really not related as such. For example, a particular SC could define the various concept neurons as being nouns, verbs, adjectives, objects, etc. However, they are represented linearly just to show they have some relationship within a particular SC. In a particular discourse, which could be sentences in a paragraph, there are illustrated a plurality of SCs. One SC is defined by the Clump ID (CID) of “9017.” There is also provided an SC “9181” and an SC “8911” with multiple SCs possibly contained there between. The entire group of SCs are organized in what is referred to as a Controller Clump (CC). In general, the flow of thought usually ties sentences together with conjunctions, or through punctuation functioning as conjunctions (assuming conjunctions). The Controller Clump is the way to store information at a much higher level. The Controller Clump provides a linear progression of the thought of a sentence. This is typically a logical progression (i.e., Want, then realization, expectation then reality). The Controller Clump is basically the essence of connected thoughts in a particular grouping of ideas defined by sentences. This, again, will be defined in more detail herein below.
  • Referring now to FIG. 6, there is illustrated an alternate diagrammatic view of how a sequence of ideas can be defined. In a discourse, there are typically paragraphs, i.e., logical breaks between discrete sets of ideas. The sentence describes a particular thought and various sentences in a paragraph define a logical progression of those thoughts which are grouped together in some logical manner. Two sequential paragraphs define two thought groups. Thus, there can be illustrated a first paragraph Para1, which has associated therewith a plurality of sentence clumps, SC1, SC2, . . . SCN. This is referred to as Controller Clump, CC1. The second paragraph, Para2, has sentence clumps, SC1, SC2, . . . SCM, tied together as a Controller Clump, CC2. What this all provides, at an even higher level, is an Outline Clump (OC). This is basically a larger-view picture of where the textual material is heading, i.e., it provides a scope or outline for a topic, e.g., “Where is this book going?” An Outline Clump could summarize a paragraph or it could provide a summarization of chapters in the book (if appropriate).
  • Referring now to FIG. 7, there is illustrated the general structure for parsing the English language based upon an input, be it some type of text or verbal input and conceptualizing that. This will be described further herein below. Additionally, the general structure for the system for storing the concept neurons, etc., will also be described herein below. For the purpose of this discussion, it is necessary to determine how concepts can flow in a particular sentence and the structure will provide some benefit. The event or text input is provided at a block 702 and this is input to a tokenizer 704. The tokenizer is basically operable to convert the text into a list of sentence tokens and/or NIDs. The tokenizer is an input to a parse block 706 to parse a sentence or fragments thereof into a set of alternative lists of tokens, with weighting alternatives. This is then output to a conceptualize block 708 that is operable to determine the concept behind particular words as they exist within a sentence. This then passes to a resolve block 710 to resolve the NIDs from a generic class to find the specific NID referred to by this particular word as it exists within the sentence. Once completed, a clump is created at a block 712 which assigns roles to groups of NIDs to form both the group of NIDs and the role associated therewith. This block then flows to a block 714 to create the relns from the clump. This is an attempt to convert the clumps to simpler form as relns with associated NIDs. This will then provide a completion of the operation, at a block 716.
  • The function of the conceptualizer 708 is to process the parser output, creating new neurons when necessary and storing conceptual associations derived from the incoming text. In this model, this is a two stage process. The first stage is organizational, in which the parser stack output is deposited into a structure to facilitate the creation of relational linkages. From this structure, the information is processed to create relational linkages between concepts. As to Organization, in the first stage, the conceptualizer 708 receives the stack output from the sentence parser, and places each word into a structure derived from the sentence structure of the English language. The structure is an array of clauses, and the first clause is the primary clause of the sentence. Each clause is structured to contain all the possible elements of a clause. The major building blocks of a clause are Noun Phrases and Verb Phrases. With respect to Noun phrases, these consist of a group of up to ten nouns that collectively are the subject, direct object, indirect object, object of the preposition, or part of a complement. Each of the nouns can have up to ten adjectives attached, and may have a preposition chain attached that is of unlimited length. Each noun may also have a relative adjective clause attached. With respect to Verb phrases, these consist of a group of up to ten verbs that contains the main verbs of the clause. In the conceptualizer 708 clause structure, the predicate of the clause is the verb phrase. Each verb may have ten adverb modifiers. The verb phrase may have one preposition chain, which is attached to the first verb of the verb phrase. General Structure of a Clause
  • The structure of a clause is divided into 5 main sections.
  • Subject—Noun Phrase
  • Predicate—Verb Phrase
  • Direct Object—Noun Phrase
  • Indirect Object—Noun Phrase
  • Complement
      • Noun Phrase
      • Infinitive Phrase
        Example: “Climates that have little humidity are called arid climates.”
    Clauses:
  • 1st Clause
      • Climates (Subject—Noun Phrase)
        • That have little humidity (Pointer to 2nd Clause)
      • are called (Predicate—Verb Phrase)
      • climates (Direct Object—Noun Phrase)
        • arid (Adjective)
  • 2nd Clause
      • that (Subject—Noun Phrase)
      • have (Predicate—Verb Phrase)
      • humidity (Direct Object—Noun Phrase)
        • little (adjective)
  • The second stage of the conceptualization process is referred to as the Emit Process” and entails taking the newly organized information and converting it into neurons and relationals. The process is done by identifying different classes of sentences that can all be processed in the same way. The different classes of sentences are identified based on the structure and the content of the sentence. Different elements within the clause structure are tested to see if they meet certain conditions. The nouns and verbs used will often hold clues that will indicate the conceptual relationships being expressed by the sentence. In order to access these clues, the nouns and verbs of each sentence are preprocessed to derive as much useful information as may be necessary. This information is then made available for easy access when needed. The information stored for each word is derived from the noun or verb info block. The types of easy access information available are:
  • 1) Info Block of noun or verb
  • 2) Flags
  • 3) Type
  • 4) ID
  • 5) Parent gamut (if any)
  • It is from this information that much of the processing decisions are made.
  • In general, the parser is a natural language parser which can utilize many different types of approaches. The type that is used in the present embodiment is a predicate-argument-parser using a Backus Nauer Form (BNF) layout description for the language. Actually, while some areas of the parser make use of conventional “predicate argument structure” (PAS) common in linguistics, it is not essential for operation. This BNF form is one of choice and many others could be used and are anticipated such that the use of such is not limiting and other parsers could be utilized to provide the parsing operation.
  • The tokenizer block 704 is the operation wherein incoming sentences are broken down into “tokens,” items corresponding to individual words and punctuation, but in which the words have already been located by some form of internal serial number, if they were already known. Therefore, this simplified list (non-text) is then forwarded to the parser for subsequent processing.
  • The parse block 706 is the device that breaks the sentences into its simpler parts, wherein the parser is defined by a top-down description of the language. In the particular parser algorithm utilized, a Backus Nauer Form (BNF) is utilized which is a natural language C++ parser for the description of the English grammar which is utilized to translate English grammar to computer language. Thus, the tokenizer initially breaks the sentence into items corresponding to individual words and punctuation and then the parser breaks the sentences down into their simpler parts and the constituent parts are saved in some manner. This is, in general, a “semantic” parser, and is driven largely by the grammar and syntax of valid sentences. These stacks of constituent parts are then passed on to the conceptualize block 708 to allow them to be converted into clump neurons. A “clump” as defined herein is a form of a neuron that remembers something that relates to action, aspect, or is only true for a specific time, such as “tomorrow.” Technically, any phrase—dependent or independent—can be formed into a clump, though that may be a poor use of memory space. Those phrases or clumps that simply define something, such as “all birds fly,” are more simply stored as relationships between simple neurons. A large class of information can be stored in this manner, although storing in neurons (not clump neurons) becomes impractical under many cases. In the system described herein, all sentences are formed into clumps; subsequent processes determine if they should be converted to neurons relns and deleted, or be permitted to remain. The main operations of conceptualization are:
      • 1. Resolve classes to instances of neurons (e.g., My parakeet Fiji with an NID of “6921,” from the “parakeet”)
      • 2. Create one or more clumps for each of the parsed history logs or lists.
      • 3. Convert the lists or logs and resolve NIDs into Roles in a clump (wherein Roles will be described herein below).
      • 4. Manage the recycling of not-to-be-used trial clumps (which will be described herein below as temporary groups of neurons that will not be stored for a long duration of time but will be used only for temporary reasons).
  • In general, the parser builds many useful lists of data that include a history of recent content in a context pool (a context pool is comprised of relns recently used, i.e., the history thereof, and which may or may not have a temporal aspect thereto, i.e., they will decay after a period of time if they have not reoccurred in the discourse so as to be reset. This provides some knowledge of the recent history of the discourse.), and some of the utilities are called on by the conceptualize block 708 in order to resolve classes (general families) of neurons into specific neurons relating to the current topic. Some clumps can be reused (shared) by several other clumps as can sentence fragments that occur multiple times, such as prepositional phrases. “Bats fly at night” and “Of course, you may find bats near the bridge this evening because bats fly at night” share a common “semantic” clump. A higher level “controller” clump manages the connection of phrases and concepts to each other whereas the “Outline Clump” summarizes the content of the paragraph. As with regular neurons, clump neurons may be later discarded—not remembered—if not re-validated over a twenty-one-day period (described herein below).
  • Certain types of clumps are purely definitive in nature, and their data is more properly stored in a set of relationships between simple neurons. An example of this would be “The sky and water are blue.” These are definitive statements, as is “My dog has four paws.” This example shows possession, a static condition which can be stored as a central concept. Example statements that can be converted to relns include:
      • 1. Subjective Compliments-“These floors, which are wet with rain.”
      • 2. Possessive-“My dog has four paws.”
      • 3. Part-ofs-“The engine includes a block, a carburetor, and several manifolds.”
      • 4. Membership-“A cat is a member of the feline species (I think)!”
  • A clump that was converted to relns may be deleted or may be retained for further use by the control or outline clump. After the conversion process, the parse-to-clump process is concluded.
  • Neuron Reins
  • As noted herein above, connections between normal neurons in the brain are made through “relationals,” “relns” for short. Relns for other types of neurons (e.g., clump or identity neurons) have special relns and are given other names such as Roles for clump neurons. Each type of neuron has its own numbering sequence (or number space) for enumerating their relational connections.
  • For normal neurons, the relns are divided into two categories, those with weighted connections and those without weights. The structure of a reln in the data space is depicted in FIG. 8. Each reln has an 8-bit field in the Most Significant Bits (MSBs) that specifies the type of reln, this being the Cmd field. The 8-bit Cmd field may be divided into separate Opc and Percentage sub-fields. This would be a 3-bit Opc field and a 5-bit Cmd field. There is provided a single 1-bit field that is associated with an Adhoc flag and the remaining bits, 23-bits, constituting the remainder of the reln. The Adhoc flag defines the bit as a temporary or a real reln, as will be described herein below. The 23-bit neuron field comprises the serial number therefore of the neuron or of the clump. The Cmd field for a non-weighted neuron relns utilizes the entire 8-bit filed such that a maximum of 255 relns can be defined. This, of course, is up to the system designer as to how relns are defined. Appendix A1 provides for some relns and their respective definitions, it being noted that this is not an exhaustive list and that additional relns are utilized or can be defined. For relns with percentage weights, the 5-bit scaled unsigned percentage field that constitutes the five Least Significant Bits (LSB) of the Cmd field are used. This leaves a very limited number of command opcodes, limited to 5-6 such relns in total. (Each such reln diminishes the available number of unweighted relns by 32, down from the total of 255 for the non-weighted relns.) Thus, there are only a total of 255 relns max, and both weighted and non-weighted relns allowed for. Of course, by increasing the length of the Cmd field, this can be remedied. It should be understood that the overall length of the reln could be increased to expand the addressing space.
  • Since only a 5-bit field is used to store the percentage, each count amounts to roughly 3.3%, going from a value of 0% for the bit value of “0” and a value of 100% for a bit value of “31.” Unless otherwise specified by the source data creating a linkage, the weight is set to approximately 50% by default. The various Op codes associated with the weighted neurons for one example of the system is set forth in Appendix A1.
  • Referring now to FIG. 9, there is illustrated a diagrammatic view of a particular set of relationships between neurons. This represents the state of this particular set of neurons and their relationships at a given time. The relationships have already been created and the neurons are “fixed” for this particular example. As such, this is representative of a static state of the system at a particular moment in time. It should be understood that during receipt of some type of input in the form of a discourse, for example, additional concepts can be introduced or, additional relationships between concepts can be introduced. As noted herein above, for example, there could be a discourse that indicated that a leopard, associated with a particular conceptual neuron, had a leash. If this particular relationship did not previously exist, then a new reln would be stored in the leopard neuron to point to the leash neuron, a previously existing neuron by assumption. At the same time, a back-reln would be created with the POSSN_OF stored in the leash neuron pointing to the leopard neuron. Additionally, it may be that not every leopard, i.e., the basic concept, would have this leash. Therefore, there may be the situation wherein a particular leopard by the name of Egor possessed the leash, which was an instance, INST reln, of the leopard neuron, such that the Egor neuron possessed such and not the leopard neuron.
  • Returning back to FIG. 9, the relationship associated with a cat neuron 902 will be discussed. The cat neuron has multiple direct relationships, i.e., once removed therefrom. The cat neuron has a parent/child relationship with a Feline neuron 904 such that there is a PARENT reln stored at the Cat neuron 902 pointing to the Feline neuron 904 and a CHILD reln stored at the Feline neuron 904 pointing to the Cat neuron 902. The Cat neuron 902 has a POSSN reln pointing to a Cat Body neuron 906 with a back-reln POSSN_OF pointing back to the Cat neuron 902 from the Cat Body neuron 906. There is also a name that is associated with the Cat neuron, that being “Garfield,” which results in a “Garfield” neuron 908 having an INST reln pointing thereto from the “Cat” neuron 902 with a back-reln of INST_OF reln pointing from the Garfield neuron 908 to the Cat neuron 902. The Garfield neuron 908 is merely the concept of a particular instance of which, when looking at the properties, will allow a system to determine the textual word associated therewith. However, the neuron itself is merely the concept of a particular instance of a cat that has associated therewith the various relns and the such. For example, the Garfield neuron 908 has a relationship with an object neuron 910 associated with a red collar, this being Red Collar neuron 910. This is a POSSN reln pointing from the neuron 908 to the neuron 910 with a back-reln POSSN_OF pointing back thereto from the Red Collar neuron 910. Additionally, the Garfield neuron 908 has a relationship with a neuron 912 being a possession relationship such that a POSSN reln exists at neuron 908 pointing to neuron 912. Neuron 912 is that associated with an object, a white striped tail. Thus, this is a White Striped Tail neuron 912. It has a back-reln POSSN_OF to the neuron 908. As such, the Garfield neuron 908 merely provides an instance of the Cat neuron 902 such that a potential query of “What bears the name Garfield?” will result in a search that will find the Garfield neuron 908 and allow the back-reln to direct the search to the Cat neuron 902.
  • The Cat Body neuron 906 has a plurality of constituent parts, these being neurons 914, 916 and 918 associated with paws, pointy ears and a tail, respectively. Each of these has a MADE_OF reln associated therewith, and each of these has a back-reln PART_OF back to the Cat Body neuron 906. In addition, the paws neuron 914 has a MADE_OF reln to a neuron 920 associated with claws, with a back-reln PART_OF to neuron 914.
  • As cat moves to higher level concepts, the Feline neuron 904, etc., a different higher level hierarchy is reached. However, the relns are not associated with hierarchy. This is merely embedded within the reins themselves. For example, the Feline neuron 904 is related to a higher level neuron, the Animal neuron 924, with a PARENT rein stored at the Feline neuron 904 pointing to the Animal neuron 924 and a CHILD rein stored at the Animal neuron 924 pointing to the Feline neuron 904. Understand that the rein stored at any given neuron defines the type of rein it is and then the NID associated with that rein defines the particular neuron to which it is related in the defined manner.
  • Additionally, the Animal neuron 924 can have a PARENT/CHILD relationship with respect to a Skunk neuron 926 (it being noted that there are probably different levels of PARENT/CHILD relationships therebetween but this has been reduced for simplicity purposes). The Skunk neuron 924 has an INST rein to a neuron 928 associated with the name Pepi. This is illustrated to show that neuron 924 also has associated therewith a POSSN relationship with respect to the neuron 912 for the white striped tail. Thus, both the Garfield neuron 908 and the Pepi neuron 928 have a relationship to the same white striped tail. Also illustrated is an ASSOC rein between the white Striped Tail Neuron 912 and the Tail neuron 918. This is a bidirectional neuron showing that there is an association between the two.
  • Referring further to FIG. 9, it is noted that these relationships, in the static mode, allow one to gain a significant amount of information about a particular concept. For example, suppose that a query were made to this particular nodal array inquiring the following “Tell me about a cat?” and including the search criteria that it include PARENT/CHILD relationships, all instances of the cat and what the cat was made of, at least to one degree. This would return information such as the following “A cat is a member of the feline category and it possesses a cat body. The cat body is made of paws, pointy ears and a tail. One particular cat is that named Garfield.” As one can see, this required nothing more than to determine the sequence of concepts centering at the Cat neuron 904 and following the relationships outward therefrom to the other concepts by only a single rein step, i.e., one degree of search.
  • Referring now to FIG. 10, there is illustrated a more simplified diagram illustrating the diagram up to at least the Feline neuron 924. Each of these neurons that are illustrated in FIG. 10, leading out the Skunk neuron 926 and the Pepi neuron 928 in addition to the Tail neuron 918, provide neuron numbers. As illustrated, each neuron number is underscored to distinguish it from a reference numeral. The Cat Body neuron number is “3011,” the Cat neuron 902 has an NID “2031” and the Feline neuron 924 has a NID “1003.” Therefore, the inquiry would be “Tell me everything about a cat.” This would be translated to the NID “2031.” This would then take off with a response “A ‘2031’ is in the category ‘1003’ and possesses a ‘3011.’ The ‘3011’ is made up of a ‘3005’ and a ‘3101.’ There is a ‘2031’ that exists that is named ‘7041.’” All that is required is to utilize the properties portion of the particular neuron to determine the text associated with that particular neuron to complete the query.
  • Referring now to FIG. 11, there is illustrated a diagrammatic view of how the various neurons illustrated in FIG. 10 are related to each other. Each of the concept neurons has associated therewith, as described herein above, an NID. Also associated therewith, in addition to the properties aspect (not shown) are the relns. Each of the relns constitutes the type of reln it is in addition to the NID of the concept neuron to which it points. For example, in the Cat neuron 902, there is assigned an NID of “2031” thereto. This has a POSSN reln associated therewith which points to the neuron “3011” which is the NID of the Cat Body neuron 906. Additionally, the Cat Neuron NID “2031” has associated therewith an INST reln pointing to NID “7041.” This is the NID of the Garfield neuron 908. Since there is a back-reln associated with each of these, the Cat Body neuron 906 has a POSSN_OF reln pointing to neuron “2031,” the Cat neuron 902. Similarly, the Garfield neuron NID “7041,” has an INST_OF reln pointing to NID “2031,” the NID of the Cat neuron 902. Similarly, all of the other neurons have the various relns pointing to different neurons. By looking at the relationships, various queries can be answered. For example, the question might be “What type of animal might have a red collar?” This would merely require the system to go to the neuron 910 and look at the back-reln to go all the way back to a neuron having a PARENT/CHILD relationship that led back to the Animal neuron 924. This will result in the neuron 902. This can quickly be done through the use of the relational interconnections and then merely look up the text associated with the neuron 902 and provide the answer, “This is a cat.” Another query could be “What does Garfield possess?” This would merely require going to the neuron 908 and going back to any neuron that would have the closest MADE_OF reln associated therewith. This would come back with the answer that it possessed “a cat body with paws, claws, pointy ears and a tail.”
  • As noted in the index, there are different types of reins. Some are parent-like, some are child-like and some are sibling-like. The parent-like reins are reins such as PARENT, PART_OF, INST_OF and POSSN_OF. This means they point back to a higher hierarchical order. The child-like reins would be those such as CHILD, INST, MADE_OF and ATTRIB. The sibling-like reins would be those such as ASSOC and NAT_ATC.
  • In another query, suppose there is some issue as to whether Pepi is related to Garfield. This would require the neuron 928 and the neuron 908 to be searched backwards to a common PARENT/CHILD relationship, i.e., it would follow the parent-like link back to a common concept neuron. When it arrived at the common concept neuron, this would constitute the relationship.
  • The concept of “Clumps” is defined as a neuron that contains a reference to a verb (a requirement) and that takes a plurality of concepts, be it words or phrases of a sentence or otherwise, and converts them to a series of semantic roles. As such, it is capable of capturing the knowledge embodied within an independent clause. With respect to clumps of sentences, this is again geared by the role of those sentences from a semantic standpoint. It should be understood that the term “relational interconnections” when used with concept neurons defines a relationship between one neuron and another neuron. In a similar manner, a “role” is still a relationship between a clump neuron and all of the concept neurons that make up the clump. The role defines the relationship of each of these neurons in the clump or each of the sentences within a clump. To understand the concept of clumps, it is first necessary to determine how clumps are formed and such discussion will follow.
  • The Predicate Argument Structure (PAS)
  • Central to any sentence or thought is the main verb. It is the driver of the sentence car. For any given sentence, selecting a different main verb from that sentence can lead to a drastically different meaning. The main verb assigns semantic “roles” or “responsibilities” to the various grammatical constituents, and when that verb changes, the entire sentence changes. The unique verb can occur in a certain manner, at a particular time, it can carry a theme, and there can be a main agent or something that experiences the verb's action. Modifiers such as roles, experiences and locations enable the transfer of words to concepts. The words are not stored—the concept behind the words are.
  • The PAS consists of multiple different roles that can be assigned to any given verb. An example of these roles are stated as follows:
      • Agent: “Georgio” painted the actors in the nude.
      • Experiencer: The dog caught the “Frisbee.”
      • Time: “In the night,” the creatures came out to play.
      • Manner: The chicken “quickly” crossed the road,
      • Place: I like to eat hot dogs “at the ball park.”
      • Topic: Snakes claim “that Chinese cooks are dangerous.”
        With the PAS information for verbs, the parser is able to understand the unique relationships that can occur between verbs and the roles associated therewith and the roles or responsibility that they may assign.
  • The various roles can be assigned at different levels. At the sentence levels, there are applied “sentence roles.” These are in addition to the Semantic Roles assigned by the verb. At the sentence level, the Sentence Roles are frequently used to connect thoughts. Correlate pairs, such as “if-then” will establish a unique relationship between sentences. Adverbial conjunctions such as “however” denote contrast to a previous statement. These play a critical role in understanding relationships between thoughts.
  • Natural Language Parser
  • Referring now to FIG. 12, there is illustrated a diagrammatic view of the parser process for taking an input from tokenized text to the clumps. This is referred to as the “Natural Language Parser.” The parsing of natural language text is one of the most difficult challenges in computing. The subtleties, nuances, innuendos, idioms, dynamic nature and ambiguities are but a portion of the quest to accurately break down natural text into its relevant and intended meaning. Behind every text is a conceptual thought, as described hereinabove. The parser discerns the thought. Through breaking down sentences into their conceptual parts, analyzing, topic tracking, and an awareness of context, previously daunting sentences are handled. Additionally, in parsing any language one is faced with multi-tiered layers of complexity. The grammatical side of the parse, as difficult of a task as it is to do accurately, is only a part of the task as a whole. Grammatically speaking, a sentence has a format and syntax it adheres to. Generally, any given language will have a rule set that is for the most part followed. English sentences flow from subject to verb to object. However, rules are constantly broken and improper grammar is frequently used. Yoda says, “Grave danger you are in. Impatient you are.” His first sentence is object—subject—verb. In common English he is saying, “you are in grave danger”. Likewise, “Impatient you are.” is object—subject—verb, translated as, “You are impatient.” These variances must be accounted for. As readers, one is able in many cases to discern the meaning of a text passage even if it was poorly or craftily crafted. After a grammatical parse is discerned, some process needs to actually understand it. It is one thing to know the main verb and subject. It is another to know and understand exactly what is happening, where it is happening, how it happening and how it relates to the greater context of the moment.
  • As noted herein above, the basic flow of the parser involves:
      • 1. Tokenization
      • 2. Pre-Rules Layer
      • 3. PAS Verb Selection
      • 4. Post-Rules Layer
      • 5. Grammatical Parse
      • 6. Rule Resolution
      • 7. Scoring
      • 8. Conceptual ‘Clumping’
  • The tokenization is initiated at a block 1202 to start at the most basic level. The target text must be prepped prior to attempting to parse it. A tokenizing process is applied to break the text into basic groupings. In language, these are often called words, i.e., the logical semantic element. These words do not have to be official words, as they could be an unknown collection of letters or a sequence of symbols. These “tokens” are the input that drives the parsing process.
  • The next step in the process is associated with the Pre-Rules Layer, as indicated by a block 1204. The pre-parsed layer marks all the relevant tokens with flags that clue the later grammatical parser. For each sentence, the Pre-Rules need only be executed or run one time. They are not changed due to different verb attempts, because they hold true no matter what the main verb ends up being.
  • The next step is the PAS Verb Selection at block 1206. The Predicate Argument Structure (PAS), or the main verb, is selected through a scoring system. The scoring system determines which possible verbs to try. Regardless of success, other options will also be selected and tried due to the recursive nature of the parser. For any given parse attempt, the PAS Verb selected is the main verb. Going forward, the parser assumes this to be true and proceeds as if it were so. This enables the parser to avoid the complexities of constantly attempting to resolve the issue during the grammatical parse.
  • The next step is the Post-Rules Layer at block 1208. Post-rules are applied to the input tokens according to the assumed selected PAS Verb. In English, there are rules that can be applied once the verb is discerned. Since the parser assumes the main verb, in any given parse, the main verb has been discerned.
  • The next step in the process will be the grammatical parse, as represented by block 1210. This is a recursive process. When parsing text, there are many “decisions” that have to be made. Many words can operate at multiple word types. Improper grammar and punctuation is often used, and that cannot prevent the parser from its task. “Decision Nodes” have been implemented that track these decisions much through the course of the parse. An example of a decision node is the following:
      • The Cops claimed that criminal.
  • A decision point occurs after the main verb “claimed.” The PAS data for the verb “claimed” indicates that “claimed” assigns a role of “theme.” This theme represents the verb “claim.” As a theme, the entire role itself can be a nested clause with its own PAS verb. At the point “that” is encountered, the grammatical parser cannot be certain if a nesting clause exists, if that is a relative pronoun, if it is an irrelevant keyword, or if that is a determiner. A nested clause is referred to by Linguis as a “CP” or complementizer phrase. Complementizers can have their heads, or words that will lead them off, or they can be assumed. These cases would look as follows:
      • The cops claimed that: All Relative Pronoun Theme
      • The cops claimed that criminals are dangerous: Nested Theme CP w/ CP head.
      • The cops claimed that criminal is dangerous: Nested Determined Theme CP w/ no CP head.
      • The cops claimed that criminal: Determined Target
        A decision node is needed at: “The cops claimed that . . . ”
  • The decision node stores an enumerated set of information regarding the decision. Nodes are coated with the realm of possibility. Decision logic determines which possibility to choose and it records that choice in a log. Some nodes lead to ambiguity, while others do not. Upon failure, or success of any given parse, all ambiguous nodes will be chased. Essentially, the other choices are made and the parser attempts to parse that particular version. In handling decisions in this manner, the parsers hands are clean. There is really no decision because all decisions that leading to a valid parse are valid and acceptable at this stage.
  • The next step in the process is determining the role resolution. In that role resolution, the grammatical roles are converted to their PAS Role counterparts. A subject may become an actor, an experiencer, etc. The PAS verbs are defined through the use of various rules that are set subject to linguistic guidelines. These roles must then be scored, this scoring viewed as a competition. This is facilitated at a block 1212. The valid grammatical parse options are the competitors vying for the parse. There are multiple layers upon which the competition can be judged, as follows:
      • 1. PAS Layer
      • 2. Role Layer
      • 3. Context Layer
        A score is calculated and the players compete. The highest score wins in this instantiation. If there is no viable option, then the decision will fall into a series of desperate modes. These modes change the way the pre-rules work and gradually become less restrictive. A sentence like, “The is my favorite determiner.” would parse once certain grammatical restrictions were loosened. The final attempt, if all else fails, is to parse the sentence as a fragment.
  • Although the grammer+PAS parser has been discussed herein, it should be understood that other parsers could be used. Grammatical parsing could be used, statistical based parsing (i.e., the type typically found in Google® type systems) could be used, or just the basic PAS parser could be used.
  • Conceptual Clumps, as described herein above, are a collection of concepts, wherein words are used to convey concepts. This collection of concepts is a collection that come together to form a thought. The output of the parser is a single clump at a block 1214 that is neatly stored in its conceptual form. During the scoring operation, either a fragment is returned, a desperation attempt is taken, etc. This is illustrated by blocks 1216, 1218 and 1220, each taking a separate path back to the initial tokenized block 1202 to pass through the process one additional time.
  • During the grammatical parse operation, there can be ambiguities, as well as at the verb selection. This ambiguity is resolved at an ambiguity node 1222. If the grammatical parse has been acceptable, it will pass to the ambiguity node 1222. Once any ambiguities are cured, this will pass back to the PAS Verb selection block 1206 and then to the scoring block 1212. The ambiguity nodes block 1222 determines if there are ambiguities and, if so, it will pass as to the last ambiguity node and then back to the grammatical parse. If the grammatical parse fails, it will pass back to the PAS Verb selection block also. With respect to ambiguities, there are two major types of ambiguity: lexical and structural. Lexical ambiguity occurs when one word can mean different things. Technically, the homograph head can be interpreted differently. Words like bank, jump, chair, or cup all have multiple meanings and uses. An example of such is, “American chair seeks arms.” There are actually two lexical ambiguities here. The first, i.e., is “chair” of the American variety or is “chair” of something that is American, e.g., leader, head-of-state? The other ambiguity is are these “arms” parts of a body or parts of a chair? In general, lexical ambiguities arise when words can function as two or more parts of speech. By comparison, structural ambiguity occurs when a phrase owner can be misapplied, such as in the phrase “He painted the actors in the nude.” The question would be as to whether the man was painting while nude or were the actors he painted nude. Generally, context can resolve structural ambiguity. Was the man known for painting nudes? Was he known to be an eccentric or radical artist? Is this statement being tied to another that may help ones understanding?
  • Various ambiguous combinations, bad punctuation, complex sentences and unknown words can lead to a wide variety of grammatical parses. For example, in the phrase “Chocko recognized Zools fly in the night.” There may be as many as fifteen different meanings that the sentence could have. For example, it could be “in the night, Chocko ‘recognized’ a certain type of fly.” or “Chocko ‘recognized’ Zools [that] fly. (flying Zools).”
  • As noted herein above, the clump basically gathers information about an action that took place and garners with all the history of that event. Such clumps are repositories for the actions taken by nouns and each such clump implies what can be viewed as the content of an independent clause, with a variant handling dependent clauses. Thus, a Conceptual Clump stores the parsed “thought,” And is generally used to hold the verb of the sentence, although it need not be. For convenience, clump neurons have their own number and space, as noted herein above.
  • At the basic level, the clump takes the words and phrases of the sentence and converts them to a series of semantic “roles.” Three types of semantic roles drive the basic sentence clump. The primary is the PAS Verb (ROLE_CLUMP_VERB). It is the main verb that assigns most of the other roles stored in the clump. It is important to note that different PAS verbs will assign different roles. Some verbs are not able to assign certain roles, and many times the roles that are assigned are restricted in various ways. These restrictions provide an aid in scoring a parse, as they will help an accurate sentence reconstruction. In addition to the main verb is the ROLE_TASPECT. It contains the tense and aspect that the PAS assigning verb utilized.
  • The last driving role at the basic sentence level is captured with one or more of the five modal roles: ROLE_M_INTENT, ROLE_M_ABILITY, ROLE_M_OBLIGATION, ROLE_M_DESIRE and ROLE_M_POSSIBILITY.
  • With these three roles, POSS, TASPECT, MODAL, it is possible to reconstruct the verb, the tense, the aspect and the modality. Something like, “Wanted to be jumping” could be captured with relns. The full sentence, “The rabbit may quickly jump over the carrot,” breaks down into Clump 2001 as illustrated in FIG. 13. In this clump, it can be seen that there are a plurality of roles that are associated with a resolved neuron ID (NID). For example, the role, ROLE_CLUMP_VERB, is resolved to be associated with the “jump” NID. The role, ROLE_TASPECT, is resolved to be associated with the PRES NID. There is provided a role, ROLE_M_POSSIBILITY, that is associated with the “may” NID, a role, ROLE_ACTOR, associated with “the rabbit” NID, a role, ROLE_EXPERIENCER_, associated with “the carrot” NID, and a role, ROLE_NANOR, associated with the concept “quickly” NID. There is also provided a command field that is a 2-bit field located in every role of the clump. Its primary purpose is to allow reusability of clumps. Were clumps to proliferate, they would consume significant memory space over time. To mitigate this, a method is implemented to permit reuse of existing clumps, such as sharing a clump that is close to what is needed. A base clump can be shared as a result to save storage space if a new clump is only different by “n” amount of roles (where “n” is determined by a function that decides if it actually saves any time/space through reuse of the clump). An alternative to this, and possibly a preferable one, is to minimize creation of these extra clumps, using linked list structures to hold the would-be content of the clumps. The worth of each such (tree-like) structure branch would then be evaluated to determine which one should be selected to be output as a clump neuron.
  • As noted herein above numerous times, a single neuron represents a single concept, and connections to other neurons establish its meaning, representation and purpose within the context of the entire database. The clumps, on the other hand, utilize “semantic roles” to capture the history of some aspects of the concept within the brain. Establishing these semantic roles for nouns and verbs allows the system to more easily restate things. The allocation of roles (and rules for their use) to words are used as an existing hierarchy of words. With or without them, the system can choose alternative word-rendering or the same concept, but roles make that choice easier. The semantic roles thus act as a “glue” that takes the meaning implied by an existing neuron interconnection—with or without hierarchy—and guides their transformation into sentences where needed. Before proceeding further, it may be helpful to describe the general concept of semantic roles in terms of the PAS.
  • The PAS Semantic Roles
  • In general, all nouns in a sentence are considered to be arguments to a verb wherein each verb assigns one label, called a semantic role, to each noun. The total of semantic roles for a particular verb is the Predicate Argument Structure, or PAS for that verb. Again, note that PAS is just one of multiple parsers that could be utilized herein and it is anticipated that other parsers could be utilized.
  • In general, the goal of this system is to separate the semantics, or meaning, from the syntax, or form (such as word order). This system is designed to handle the various ways that one can change word order or even tense in English, but retain the same meaning. There are provided three examples of this word-order change. They are as follows:
  • Active vs. Passive Sentences:
      • The sentences “I ate the cookie” and “The cookie was eaten by me” mean the same thing. Both of them have the eatee and the eater, but the word order is different. The PAS model tags the eater as the Agent, and the eatee as the Experiencer, regardless of the order of the words.
  • Ditransitive or VG verbs.
      • Ditransitive verbs are verbs such as “give” which can take to objective, as in “Moses gave ‘Miriam a cookie.’” All ditransitive verbs may also be used as a regular transitive verb, with an additional prepositional phrase, as in “Moses gave ‘the cookie to Miriam.’” In both forms, however, Miriam is still the person who has the cookie at the end, Moses is still the person who gave it to her, and the cookie is still the thing that was given.
      • There is also under the usage of the ditransitive, and though it may be handled the same way in practice, it is for the special enumeration. In some dialects only, words which are normally not ditransitive may become so in order to denote the beneficiary. For instance, “I'm going to paint ‘me a fence,’” or “I'm going to kill ‘me a bear.’” The final form, called the attributive ditransitive usage, occurs only in verbs which attribute one object with another object or with a quality. For instance, “You shall call ‘his name John.’” or “Let me make ‘you a nice supper.’” or “Octavia will call ‘Isabel a cab.’” This can be confusing at times-will Octavia now refer to Isabel as a cab, or is Octavia calling a cab for Isabel's benefit?
  • Ergative Verbs
      • Ergative verbs behave differently if they are used transitively or intransitively. If they are used intransitively, they then mean that the action was done to the subject. However, if they are used transitively, then they mean that the action was done “by” the subject to the direct object. Examples of this are the sentence “The alien ray gun liquefied the green chair.” and “The green chair liquefied.” For example, in both of these sentences, “The alien ray gun liquefied the green chair” and “The green chair liquefied,” the same object was liquefied—the green chair—even though it was the object in the first sentence and the subject in the second. It is worth noting that the second sentence is not passive of the first. The passive would be “The green chair was liquefied by the alien ray gun.”
  • There are three categories of roles provided, Universal Roles, Regular Role, and Specially-Assigned Roles. These are defined wherein Universal Roles are roles that may be used in any sentence no matter what the verb type. Regular Roles are roles that must be assigned by the verb; they may not appear in just any sentence, and Specially-Assigned Roles are those roles that may only be assigned by certain kinds of verbs. These roles are defined in Appendix B.
  • Returning to the clump definitions, in addition to the simple clump described in FIG. 13, there are also provided Nested Sentence Clumps. These are slightly more complex than a basic clump in that this type of clump contains a nested clump functioning as an entire role assigned by the PAS verb sentences with relative adjectives (Radjs), nested WH roles, and gerunds are examples of this. For example, the sentence, “The rabbit that bit me may quickly jump over the carrot.” This illustrates clump “2001” referring to clump “2002” in the role, ROLE_ACTOR. This shows the additional role, ROLE_CLUMP_VERB, as being associated with the word “bit” and the role, ROLE_EXP, as being associated with the word “me.” Sometimes, however, relative adjectives (Radj) will not need to be stored as nested clumps, even though that is how they will grammatically parse. A Radj is used to determine the noun. It is helpful to resolve a noun by determining “which one” one is considering. However, if the clump Radj forms is new information, it should be stored. The system will either resolve a noun or it will have to create a new concept that awaits resolution with a high priority flag.
  • Conjoined Sentence Controller Clumps
  • The flow of thought usually ties sentences together with conjunctions, or through punctuation functioning as conjunctions (assumed conjunctions). These are at the sentence level and are used to coordinate, subordinate, or establish a condition (if/then). An explicit example of this is “Luke wanted peanuts, however he got cashews.” An assumed example of this is “Luke wanted peanuts. He got cashews.” One could choose to make two standard clumps and store a forward sentence role from one and a back link to the other, but that might be considered restrictive. There are various forms the two thoughts could come through, and if the system is allowed to pick any particular one to standardize on it might run into complexity issues that the system should not have to deal with (i.e., “Luke got peanuts but he wanted cashews.”).
  • A more accurate way to store the information is with a Controller Clump at a higher level, as described herein above. The assumption of the controller is that it progresses linearly, i.e., it is a linear flow of concepts. Connected clumps are stored in a linear fashion. If one is not explicitly expressed, the system will store them in a logical progression (i.e., Want, then, Realization, Expectation, then Reality). The controller is the essence of the connected thoughts and this is illustrated in FIG. 15.
  • In FIG. 15, the clump “2001” is defined as the Controller Clump. This has two roles, the SENT_SEQ role and the SENT_CONTRAST role. The first role is assigned or points to the resolved NID of clump “2002,” and the second points to the resolved NID of clump “2003.” Clump “2002” has a plurality of roles associated therewith. This is for the words “want,” “Luke,” and “peanuts.” The verb “want” is the main verb for this clump and occupies the role, ROLE_CLUMP_VERB. The word “Luke” takes the role of the actor and the word “peanuts” takes the role of the experiencer. The owner is the Controller Clump, clump “2001.” The clump “2003” is associated with the words “got,” “Luke,” and “cashews.” The main verb in this clump is the verb “got,” the actor is the word “Luke,” and the experiencer is the word “cashews.” The owner, again, is clump “2001.” With this arrangement, storing at the higher level will allow for a greater diversity of expression on the output side and also for a strong ability to match concepts in the analyzing process (such as for answering questions).
  • Referring now to FIG. 12 a, an exemplary flow diagram illustrating linking between clumps is shown. The system has various types of neurons of which Experience neurons and normal neurons link to clumps. As to “Experience neurons,” all other neurons capture knowledge, relationships and facts, but the Experience neuron captures the emotions, timing and expectations of an experience. In this way, this neuron defines the essential concept of an experience. For convenience, the neuron IDs for experiences have their own numbering space. As such, Experience neurons capture a further level of detail and nuance that comes with first person experience. The core of an experience is a clump, thus it will be linking to clumps. All normal neurons are representative of a “concept” and from these concepts, there is linkage to groupings of concepts (the clump). “A dog that jumped a road, ate a bird and then went to sleep,” links to a controlling clump representing the entire sequence.
  • Each concept that played a role will have a link to the clump they played a role in. The bird clump can get back to the cause of its clump by going to the controller clump. A “conceptualizer” logic decides what to do with the clump. It may need to be broken down and stored as class level properties, if or even a process neuron.
  • Emotional impact of any clump can be stored by adding a ROLE_EMOT reln in the clump. For example, a particular statement of a particular political speech may have incited anger, and that could be stored in the exact offensive sentence level clump. However, the speech itself may not have been offensive, thus the “political speech” clump would not get the ROLE_EMOT.
  • Another asset is the ability of Experience neurons to point to clumps. If the experience of getting angry met the criteria, it may get its own Experience neuron (‘exper’). In this case we would link the exper to the clump and have the option of storing an EXPER_EMOT in the experience.
  • In the end, clumps are a collection of concepts/thoughts, and by interconnecting thoughts with neurons and assigning emotional impact to them we increase flexibility of the system and we are one step closer to emulating the human brain.
  • Outline Clump
  • The Outline Clump (OC) is much like the table of contents for a book. It gives the overall purpose or subject of a discussion and provides a Table-of-Contents-like layout of the material. A single OC, for example, summarizes the contents of a training document the brain has absorbed. Another might outline a movie or an experience such as my “Hawaiian Vacation of 1999”.
  • The OC is a holding place for a series of Controller Clumps (CCs) arranged in a hierarchical fashion. At the lowest level, sentences are comprised of Semantic Clumps (SCs). Each paragraph, for example, is managed by a Controller Clump that includes a number of SCs and a final summary SC. If one is dealing with a book they are reading, higher-level CCs are arranged to account for sections of a chapter. Above them, an Outline Clump (OC) orders the chapter CCs into an overall book. It is possible to discard all the lower-level semantic clumps (SCs) except for the summary clumps, if desired, to provide a synopsis of the book. Each CC and OC has a summary clump attached to it, regardless of the level in the structure. For example, a “section” CC has below it a summary SC for every paragraph. The section itself has a summary SC that is a “roll-up” of the SCs for all the paragraphs below it. Finally, the Outline Clump (OC) has a summary SC that summarizes what the book itself is about.
  • The OC differs from other types of clumps in that it can wrap up both textual or discussion content as well as experiences (the “expers”, as discussed hereinabove). The types of knowledge in an Experience neuron is organized quite differently from clumps and conceptual neurons. It includes such things as expectations and anticipations (met or not) and various emotional and time stamps that relate to the experience. The OC has the least information of all clumps but contains the highest-level synopses of multiple types of knowledge. It is truly an outline or staging point from which to recall knowledge from many levels.
  • Clump Management
  • The use (and reuse) of clump neurons must be explicitly managed. A proposed new clump neuron may be similar to an existing clump neuron (“clump”). It may be shorter to create a MAKE_FROM neuron using the original clump as a pattern or template, rather than creating the full clump that largely replicates an existing item. The process for creating a non-duplicate clump takes the following steps:
      • 1. Look for matches.
      • 2. If no match, create a new clump.
      • 3. If there is an exact match, return to clump ID of the matched neuron.
      • 4. If there is a close but non-perfect match, use an existing clump as a make-from template.
        In this way, the system maximizes the use of its memory resources.
  • When creating new clumps, it is necessary for the system to first determine if a similar one exists. The various degrees of matching of the proposed new clump and an existing clump are possible. A “matched” clump is one that has an identical role table set. That is, all of the roles are the same, and the role values are the same (the neuron IDs the roles point to). Alternatively, a partial match can exist wherein some roles match up exactly, but others do not. The system must then determine what threshold will create a new clump vs. reusing an existing one. As an example, consider the phrases “The rabbit bit Thomas,” “The rabbit bit Luke,” and “Skip bit the rabbit.” This is illustrated in FIG. 16. This defines the first sentence with the verb being “bit,” the actor being “the rabbit,” and the experiencer being “Thomas.” This is a newly created clump “2002.” When the system encounters the next sentence, it is necessary to decide what the next step requires. A function needs to determine that a very similar clump to this exists. The output of the decision would be to use clump “2002” as a starting point (since it is similar) (only off by one role)). This is accomplished by setting a new clump's class integer MAKE_FROM to the value of the old clump, clump “2002.” This would retain all the information therein and in a new table associated with the new clump, the system would only specify the change with the replace command. This would be as set forth in FIG. 17. This response to the command-MAKE_FROM=“2002.” This essentially replaces “Thomas” with “Luke” as the experiencer. This would instruct the clump merely to replace the experiencer role with a new experiencer role. On the output side, the tool that looked at clumps would see the clump table in FIG. 18 wherein the role of the verb, the actor and the experiencer were replaced or grabbed from the template. As such, these are not actually stored in the new clump, but just appear to be in the clump. It is noted that the command line for the “Luke” word as the experiencer has the information, ROLE_REPLACE, associated therewith.
  • In the example of a non-marking clump, consider the phrase “Skip bit the rabbit” wherein only a single match exists, and that is on the verb. The other two roles do not match. For such non-matching clumps where there are more differences than matches, a new clump is created for the proposed clump.
  • It will be appreciated by those skilled in the art having the benefit of this disclosure that this method and apparatus for defining an artificial brain via a plurality of concept nodes connected together through predetermined relationships provides an artificial brain that arranges a plurality of concepts in individual nodes of a nodal array for the purpose of allowing them to have interconnecting relationships that can be used to define a concept flow. It should be understood that the drawings and detailed description herein are to be regarded in an illustrative rather than a restrictive manner, and are not intended to be limiting to the particular forms and examples disclosed. On the contrary, included are any further modifications, changes, rearrangements, substitutions, alternatives, design choices, and embodiments apparent to those of ordinary skill in the art, without departing from the spirit and scope hereof, as defined by the following claims. Thus, it is intended that the following claims be interpreted to embrace all such further modifications, changes, rearrangements, substitutions, alternatives, design choices, and embodiments.
  • Appendix A1 Non-Weighted Reins
  • Non-weighted reins are listed below. The entire 8-bit Cmd field is used in the enumeration. The enumeration value itself is not given because it changes from time to time. It should be understood that these are just representative of the type of information that can be stored in a relational neuron and is not meant to be exhaustive, and that any type of relationship can be so crafted.
  • Reln Cmd Code Usage Usage of 24 LSBs
    R_ASSOC Association NID is a neuron associated with this one.. For example, the mud
    neuron may have an RASSOC pointing to earth, and earth has
    an identical one pointing back to mud. This reln is fully
    symmetric. It acts as an alternative to actively-firing neurons.
    See R_SPLIT for further information and usage.
    R_BLOCK Gamut (or other This reln indicates that a block of data will follow that is to be
    list) follows as a processed or ignored as a whole. (This replaces the former
    block of relns R_GAMUT reln.) The bits 0 . . . 7 are # elements; bits 8 . . . 15 are
    block type. For example, the irregular verbs have a present tense,
    past tense and past participle tense, and the three are laid out as 3
    elements of a block of type R_IRR. Each has an R_IRR reln
    pointing back to the present-tense form.
    R_CHILD Child NID of my child (class). This sets the NID as a child concept to
    the present (parent) neuron. For example, if solar~system is the
    parent neuron, earth would be the NID for an R_CHILD.
    R_CLUMP Action (from CID links to a clump neuron. For “The cow jumped over the
    noun) moon,” the cow neuron (the noun actor) would contain an
    R_CLUMP pointing off to a clump that describes the action.
    This differs from R_VCLUMP in that an identical clump CID
    would be pointed to by the jump (verb) neuron to the same
    clump. That allows access to the information either from the
    actor or the action side.
    R_CPLX Complex Installed in the parent neuron, this points to a derived child
    neuron ‘complex’. It may be for the purposes of adding an
    adjective (“blue dog” or “seat belt”), or for creating an instance
    from a parent class. The NID is the complex derived from this
    (noun), and the complex has a back-reln of R_CPLX_OF
    pointing back to the parent.
    R_CPLX_OF Complex-of The first reln of a complex neuron, this is the back-reln to (noun)
    parent of the complex. NID is noun this complex is derived
    from.
    R_EMOT Attached Used to attached emotional content to a neuron (e.g., to give
    Emotion ‘warm fuzzies’ or concern), this reln directly indicates the
    emotion to be evoked. The 24 LSBs are composed of a 6-bit
    Firing level, a 6-bit Count, and then a 12 Neuron_Id to a special
    neuron (of emotion).
    The Count is used to . . . (see Luke)
    R_IDENT_INST Link to an The “Jack” neuron has an R_IDENT_INST to each instance of
    identity instance. Jack. Each such instance of a normal neuron will have an
    R_IDENTITY with an Iid pointing off to an Identity neuron.
    R_IDENTITY Link to an This reln links the current neuron to a specific identity neuron.
    identity The 24 LSBs are an IID pointing to the identity neuron. (The
    identity neuron is separate from the neuron for a person around
    which his character and experiences are built. The IID contains
    certain things specific to all identities, such as name, birthplace,
    birth-date, and so on.)
    R_INST Instance This reln defines an instance of a class, e.g., Fido versus dog. It
    can be used in any class or concept neuron to point out a specific
    occurrence of that class or concept. It points to a new neuron
    that is built up to describe (my) Fido instance and its
    characteristics. In that instance neuron is an R_INST_OF back-
    reln that points back to the class or concept neuron.
    The Nid in the 24 LSBs points to the new instance neuron..
    R_INST_OF Back-Reln for This is the back-reln to an RINST. Found in the instance
    Instance neuron, its' NID points back to the parent class or concept
    neuron. See R_INST for more details.
    R_MADEOF Made-of Cakes are made of flour, eggs, water, milk and several other
    ingredients. This reln may be used inside a noun neuron to
    indicate what that noun is made up of. In this cake example,
    there is on R_MADEOF for each ingredient, with the NID in the
    24 LSBs pointing to the constituent neuron.
    Each of the constituent ingredients in turn has a back-reln of
    R_PARTOF pointing back to the ‘cake’ (noun) neuron.
    R_NOT Negation Target (noun) is element of this class, Compl the cdx.
    R_OP Object of prep Specifies the object of the preposition (paired with R_PREP).
    Prep phrases are attached to a noun only for ‘SUCO’ (subject-
    complement) cases, in which there is no time tense associated
    with the connection. Example: An instance of a car, “the car in
    the shop”. All other prep phrases are stored using clump neurons.
    Whenever an R_PREP reln is found, it will be immediately
    followed by an associated R_OP.
    R_OP_OF Back reln of OP This is a back-reln for an object-of-the preposition and points
    back to the noun that prep phrase is attached to. See R_OP for
    details.
    R_PARTOF Back-reln An object is made up of other items, and will have R_MADEOF
    converse of relns pointing to the items that make it up. Conversely, the parts
    MADEOF that make up the item each have an R_PARTOF back reln that
    indicates the class of object of which it is a part of.
    For example, if a truck is made up of an engine, a chassis and a
    drive train, truck will have an R_MADEOF pointing to each of
    these three items. Chassis will in turn have an R_PARTOF
    pointing back to truck.
    R_PROP Phys Property Any noun may have one or more property lists that describe some
    (list) physical aspect of an object. These may include color, texture,
    size and the like and are defined in the chapter on Noun
    Properties. The variable-length lists are recorded in an
    R_BLOCK list, with the length given in the 8 LSBs, and with the
    R_PROP enumeration given in bits 8 . . . 15 of the R_BLOCK.
    R_PROP never stands alone as a reln.
  • Appendix A2
  • The below table defines the weighted opcodes:
  • Reln Cmd Code Usage Usage of 24 LSBs
    R_PARENT Amount of Implies the degree of influence by a parent concept, specifically
    parent for verbs. The NID is the parent neuron.
    Connection
    R_INH Inhibit neuron When the present neuron is firing, inhibit the firing of the neuron
    From Firing whose MD is given by the connection weight. Slight = 10%,
    suppression = 100%.
    R_CDX Percent of This is the means of connecting an adjective to a noun. “Rather
    Target Firing red” may imply a red weighted at 30%. The NID points to the
    adjective.
  • Appendix B Universal Roles
  • These roles may be used in any sentence, no matter the verb type.
  • ROLE_TIME (tim)
  • This role includes all time- or date-related phrases, including concurrence.
  • Time may be a prepositional phrase, an adverb phrase, or an appropriate proform.
  • Example: “The specimen is leaving the petryi dish now.”
  • ROLE_DURATION (dur)
  • This role includes all durations, such as “The heated discussion lasted for three hours,” and “I've eaten nothing but cookies and coke for a year.”
  • Duration may be a prepositional phrase headed by “for” followed by a period of time, a propositional phrase headed by “during” followed by a noun which denotes something happening over a period of time, or a complementizer phrase connected with ‘while’ or ‘as’.
  • Examples: “I was talking to the spacemen during the launch.” “Millicent Cow kissed the canary while the dogs looked on.”
  • ROLE_REASON (rsn)
  • The stated purpose for the action, for example “Katrin knew he wanted her to read that book, so she read it for him,” or “Bjorn sold his soul for $20.”
  • Reason may be a prepositional phrase beginning with because, a complementizer phrase beginning with the complementizer so, or in order that.
  • ROLE_MANNER (man)
  • An adverb-like phrase which describes how the verb was performed.
  • Example: “She died with dignity.”
  • Manner may be a prepositional phrase or an adverb phrase.
  • ROLE_ACCOMPANIMENT (acc)
  • This semantic role denotes the entities which accompanied the subject of the sentence. For example, “I went to the software store with Anne-Marie.” or “Go with God.”
  • The subject of the sentence need not be the Agent, as in “She was attacked with her friend Stacy.”
  • Accompaniment may be a prepositional phrase beginning with alongside or with, or it may be the adverb phrase together.
  • ROLE_INSTRUMENT (ins)
  • The Agent is the initiator of the action; the Instrument is the thing that the Agent used.
  • Example: “I brutally murdered him with a spatula.”
  • Note that an Agent need not be in the sentence for Instrument to be present, for example, in “He was killed by the sword,” the one doing the killing (the Agent) is not there, but the sword (the Instrument) is.
  • Instrument may be a noun phrase or gerund phrase.
  • Regular Roles
  • These roles must be assigned by the verb; they may not appear in just any sentence.
  • ROLE_ACTOR (atr)
  • The person which does the action, as in “The redcoats are coming!”
  • Actor may be a noun phrase, gerund phrase, or complementizer phrase.
  • ROLE_EXPERIENCER (exp)
  • Experiencer is the noun which undergoes change or feeling, as in “Jonny feels very sad today,” or “He attacked me without warning!” Note: All state verbs must assign an exp in active cps. The verb “to become” assigns an experiencer based on the following line of reasoning: The verb “to change,” as in “Rudolph changed Nancy into a nicer person,” assigns an Actor (Rudolph) and an Experiencer (Nancy). In “to become,” as in “I became nicer,” the Actor is changing himself to be nicer. In our model, we have called him the Experiencer, though this is arbitrary and he just as easily could be the Actor.
  • Experiencer may be a noun phrase, gerund phrase, or complementizer phrase.
  • ROLE_THEME (thm)
  • Similar to Topic. As in, “I joined you in your quest for truth,” or “Quit involving me in your problems,” “Dating involves lots of money.”
  • Themes may be either abstract nouns or gerunds, or complementizer phrases.
  • ROLE_EFFECT (eff)
  • The outcome of an action. An Effect may only be assigned if the Effect noun came into existence because of the verb. That is, you must be able to replace the original verb with “effect” (not “affect”), and the sentence must hold true.
  • For example, in the sentence “She baked cookies.”, it is saying that she made cookies—she caused the cookies to exist by the act of baking. Thus, “cookies” is the Effect.
  • On the other hand, in the sentence “She baked cookie dough.”, it is saying that she put the cookie dough through a process of baking, but not that she made it cookie dough by baking it.
  • Therefore, “cookie dough” would not be considered Effect, but Experiencer.
  • Effect does not require an Actor to be assigned. The sentence “I died a painful death.” is an example of this. “I”, in this case, is an Experiencer, not an agent.
  • Effect may be a noun phrase OR an adjective due to a state or state verb—I made her angry. ‘her angry’ is the effect.
  • Specially-Assigned Roles
  • These roles may only be assigned by certain kinds of verbs
  • ROLE_STATE (sta)
  • This semantic role is for the express use of state verbs, such as “to be,” as in “I am happy,” and also for sensing verbs, such as “look,” as in “You look terrible.” Note that these are not noun phrases, but adjective phrases. This is a very rare phenomenon and very restricted—adjectives are only included as part of the predicate argument structure here because they are required by the verb. Other verbs that assign to adjective phrases are “act,” as in “You're acting strange today,” “seem” as in “You seem quite nice,” and “stay” as in “I stay warm most of the time.” State may be an adjective phrase only.
  • ROLE_IDENTITY (idt)
  • Identity was created for, but not limited to, being verbs, as in “I am Hannah”. “Hannah” receives the role of the identity, because it refers to the same entity as does the subject. Any noun phrase that is not the subject but is its coreferent is assigned the role of identity. Identity burns a implies reln from the exp.
  • Identity may be a noun phrase, gerund phrase, or complementizer phrase.
  • ROLE_MEASURE (msr)
  • All measurement verbs assign this role, such as “Pterodactyl's wingspan measures forty feet.” Measure may be a noun phrase or gerund phrase. This is the only role which can be a NP, but cannot function as the Object of a sentence.

Claims (4)

1. A method for parsing the flow of natural human language to convert a flow of machine recognizable language into a conceptual flow, comprising the steps of:
recognizing a lexical structure;
determine a basic semantic grouping for the language flow in the lexical structure;
determine the basic semantic grouping that denotes the main action, occurrence or state of being for the language flow;
determining the responsibility of the main action, occurrence or state of being for the language flow within the lexical structure;
semantically parsing the lexical structure; and
resolving any ambiguities in the responsibilities in a recursive manner by applying a predetermined set of rules thereto.
2. The method of claim 1, wherein the basic semantic groupings are words.
3. The method of claim 1, wherein the lexical structure comprises a flow of sentences.
4. The method of claim 3, wherein the main action, occurrence or state of being for the language flow comprises the main verb.
US14/273,143 1996-05-06 2014-05-08 Natural language parsing method to provide conceptual flow Abandoned US20140303963A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/273,143 US20140303963A1 (en) 1996-05-06 2014-05-08 Natural language parsing method to provide conceptual flow

Applications Claiming Priority (18)

Application Number Priority Date Filing Date Title
US1691896P 1996-05-06 1996-05-06
US53449204P 2004-01-06 2004-01-06
US53464104P 2004-01-06 2004-01-06
US53465904P 2004-01-06 2004-01-06
US3045205A 2005-01-06 2005-01-06
US11/154,313 US7089218B1 (en) 2004-01-06 2005-06-16 Method for inclusion of psychological temperament in an electronic emulation of the human brain
US76444206P 2006-02-02 2006-02-02
US79016606P 2006-04-07 2006-04-07
US81129906P 2006-06-05 2006-06-05
US11/425,688 US7849034B2 (en) 2004-01-06 2006-06-21 Method of emulating human cognition in a brain model containing a plurality of electronically represented neurons
US11/670,959 US20070156625A1 (en) 2004-01-06 2007-02-02 Method for movie animation
US11/697,721 US8001067B2 (en) 2004-01-06 2007-04-07 Method for substituting an electronic emulation of the human brain into an application to replace a human
US91109907P 2007-04-11 2007-04-11
US11/758,667 US7925492B2 (en) 2004-01-06 2007-06-05 Method for determining relationships through use of an ordered list between processing nodes in an emulated human brain
US95424707P 2007-08-06 2007-08-06
US1691807P 2007-12-27 2007-12-27
US12/101,904 US8725493B2 (en) 2004-01-06 2008-04-11 Natural language parsing method to provide conceptual flow
US14/273,143 US20140303963A1 (en) 1996-05-06 2014-05-08 Natural language parsing method to provide conceptual flow

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US12/101,904 Continuation US8725493B2 (en) 1996-05-06 2008-04-11 Natural language parsing method to provide conceptual flow

Publications (1)

Publication Number Publication Date
US20140303963A1 true US20140303963A1 (en) 2014-10-09

Family

ID=39763540

Family Applications (2)

Application Number Title Priority Date Filing Date
US12/101,904 Expired - Fee Related US8725493B2 (en) 1996-05-06 2008-04-11 Natural language parsing method to provide conceptual flow
US14/273,143 Abandoned US20140303963A1 (en) 1996-05-06 2014-05-08 Natural language parsing method to provide conceptual flow

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US12/101,904 Expired - Fee Related US8725493B2 (en) 1996-05-06 2008-04-11 Natural language parsing method to provide conceptual flow

Country Status (1)

Country Link
US (2) US8725493B2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9009029B1 (en) * 2012-11-01 2015-04-14 Digital Reasoning Systems, Inc. Semantic hashing in entity resolution
US20170330104A1 (en) * 2016-05-13 2017-11-16 Cognitive Scale, Inc. Ingesting Information into a Universal Cognitive Graph
US11227127B2 (en) 2019-09-24 2022-01-18 International Business Machines Corporation Natural language artificial intelligence topology mapping for chatbot communication flow

Families Citing this family (118)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7925492B2 (en) 2004-01-06 2011-04-12 Neuric Technologies, L.L.C. Method for determining relationships through use of an ordered list between processing nodes in an emulated human brain
US7089218B1 (en) 2004-01-06 2006-08-08 Neuric Technologies, Llc Method for inclusion of psychological temperament in an electronic emulation of the human brain
US8473449B2 (en) 2005-01-06 2013-06-25 Neuric Technologies, Llc Process of dialogue and discussion
US8874477B2 (en) 2005-10-04 2014-10-28 Steven Mark Hoffberg Multifactorial optimization system and method
US8688749B1 (en) 2011-03-31 2014-04-01 Palantir Technologies, Inc. Cross-ontology multi-master replication
US8515912B2 (en) 2010-07-15 2013-08-20 Palantir Technologies, Inc. Sharing and deconflicting data changes in a multimaster database system
US7962495B2 (en) 2006-11-20 2011-06-14 Palantir Technologies, Inc. Creating data in a data store using a dynamic ontology
US8554719B2 (en) 2007-10-18 2013-10-08 Palantir Technologies, Inc. Resolving database entity information
US10747952B2 (en) 2008-09-15 2020-08-18 Palantir Technologies, Inc. Automatic creation and server push of multiple distinct drafts
US8463720B1 (en) * 2009-03-27 2013-06-11 Neuric Technologies, Llc Method and apparatus for defining an artificial brain via a plurality of concept nodes defined by frame semantics
US8909566B2 (en) 2009-06-23 2014-12-09 Oracle International Corporation Method, a computer program and apparatus for analyzing symbols in a computer
US20110046941A1 (en) * 2009-08-18 2011-02-24 Manuel-Devados Johnson Smith Johnson Advanced Natural Language Translation System
US8161063B2 (en) * 2009-09-11 2012-04-17 Eastman Kodak Company Multimedia object retrieval from natural language queries
US8943094B2 (en) 2009-09-22 2015-01-27 Next It Corporation Apparatus, system, and method for natural language processing
US8364642B1 (en) 2010-07-07 2013-01-29 Palantir Technologies, Inc. Managing disconnected investigations
US9244984B2 (en) 2011-03-31 2016-01-26 Microsoft Technology Licensing, Llc Location based conversational understanding
US10642934B2 (en) 2011-03-31 2020-05-05 Microsoft Technology Licensing, Llc Augmented conversational understanding architecture
US9298287B2 (en) 2011-03-31 2016-03-29 Microsoft Technology Licensing, Llc Combined activation for natural user interface systems
US9858343B2 (en) 2011-03-31 2018-01-02 Microsoft Technology Licensing Llc Personalization of queries, conversations, and searches
US9760566B2 (en) 2011-03-31 2017-09-12 Microsoft Technology Licensing, Llc Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof
US9842168B2 (en) 2011-03-31 2017-12-12 Microsoft Technology Licensing, Llc Task driven user intents
US9064006B2 (en) 2012-08-23 2015-06-23 Microsoft Technology Licensing, Llc Translating natural language utterances to keyword search queries
US9454962B2 (en) * 2011-05-12 2016-09-27 Microsoft Technology Licensing, Llc Sentence simplification for spoken language understanding
US8732574B2 (en) 2011-08-25 2014-05-20 Palantir Technologies, Inc. System and method for parameterizing documents for automatic workflow generation
US9495352B1 (en) * 2011-09-24 2016-11-15 Athena Ann Smyros Natural language determiner to identify functions of a device equal to a user manual
US8782004B2 (en) 2012-01-23 2014-07-15 Palantir Technologies, Inc. Cross-ACL multi-master replication
US9396179B2 (en) * 2012-08-30 2016-07-19 Xerox Corporation Methods and systems for acquiring user related information using natural language processing techniques
US9348677B2 (en) 2012-10-22 2016-05-24 Palantir Technologies Inc. System and method for batch evaluation programs
US9081975B2 (en) 2012-10-22 2015-07-14 Palantir Technologies, Inc. Sharing information between nexuses that use different classification schemes for information access control
US9501761B2 (en) 2012-11-05 2016-11-22 Palantir Technologies, Inc. System and method for sharing investigation results
US9594745B2 (en) 2013-03-01 2017-03-14 The Software Shop, Inc. Systems and methods for improving the efficiency of syntactic and semantic analysis in automated processes for natural language understanding using general composition
US10140664B2 (en) 2013-03-14 2018-11-27 Palantir Technologies Inc. Resolving similar entities from a transaction database
US8930897B2 (en) 2013-03-15 2015-01-06 Palantir Technologies Inc. Data integration tool
US8855999B1 (en) 2013-03-15 2014-10-07 Palantir Technologies Inc. Method and system for generating a parser and parsing complex data
US9740369B2 (en) 2013-03-15 2017-08-22 Palantir Technologies Inc. Systems and methods for providing a tagging interface for external content
US8903717B2 (en) 2013-03-15 2014-12-02 Palantir Technologies Inc. Method and system for generating a parser and parsing complex data
US8868486B2 (en) 2013-03-15 2014-10-21 Palantir Technologies Inc. Time-sensitive cube
US9898167B2 (en) 2013-03-15 2018-02-20 Palantir Technologies Inc. Systems and methods for providing a tagging interface for external content
US8924388B2 (en) 2013-03-15 2014-12-30 Palantir Technologies Inc. Computer-implemented systems and methods for comparing and associating objects
US8909656B2 (en) 2013-03-15 2014-12-09 Palantir Technologies Inc. Filter chains with associated multipath views for exploring large data sets
US8886601B1 (en) 2013-06-20 2014-11-11 Palantir Technologies, Inc. System and method for incrementally replicating investigative analysis data
US8601326B1 (en) 2013-07-05 2013-12-03 Palantir Technologies, Inc. Data quality monitors
US9223773B2 (en) 2013-08-08 2015-12-29 Palatir Technologies Inc. Template system for custom document generation
US8938686B1 (en) 2013-10-03 2015-01-20 Palantir Technologies Inc. Systems and methods for analyzing performance of an entity
US9105000B1 (en) 2013-12-10 2015-08-11 Palantir Technologies Inc. Aggregating data from a plurality of data sources
US10579647B1 (en) 2013-12-16 2020-03-03 Palantir Technologies Inc. Methods and systems for analyzing entity performance
US9009827B1 (en) 2014-02-20 2015-04-14 Palantir Technologies Inc. Security sharing system
US8935201B1 (en) 2014-03-18 2015-01-13 Palantir Technologies Inc. Determining and extracting changed data from a data source
US9836580B2 (en) 2014-03-21 2017-12-05 Palantir Technologies Inc. Provider portal
US9984067B2 (en) * 2014-04-18 2018-05-29 Thomas A. Visel Automated comprehension of natural language via constraint-based processing
US10572496B1 (en) 2014-07-03 2020-02-25 Palantir Technologies Inc. Distributed workflow system and database with access controls for city resiliency
WO2016007923A1 (en) 2014-07-11 2016-01-14 Craymer Loring G Iii Method and system for linear generalized ll recognition and context-aware parsing
US9953171B2 (en) * 2014-09-22 2018-04-24 Infosys Limited System and method for tokenization of data for privacy
US9229952B1 (en) 2014-11-05 2016-01-05 Palantir Technologies, Inc. History preserving data pipeline system and method
US9792095B2 (en) 2014-11-25 2017-10-17 Symbol Technologies, Llc Apparatus and method for converting a procedure manual to an automated program
US9483546B2 (en) 2014-12-15 2016-11-01 Palantir Technologies Inc. System and method for associating related records to common entities across multiple lists
US11302426B1 (en) 2015-01-02 2022-04-12 Palantir Technologies Inc. Unified data interface and system
US10803106B1 (en) 2015-02-24 2020-10-13 Palantir Technologies Inc. System with methodology for dynamic modular ontology
US9727560B2 (en) 2015-02-25 2017-08-08 Palantir Technologies Inc. Systems and methods for organizing and identifying documents via hierarchies and dimensions of tags
US10103953B1 (en) 2015-05-12 2018-10-16 Palantir Technologies Inc. Methods and systems for analyzing entity performance
US10628834B1 (en) 2015-06-16 2020-04-21 Palantir Technologies Inc. Fraud lead detection system for efficiently processing database-stored data and automatically generating natural language explanatory information of system results for display in interactive user interfaces
US9418337B1 (en) 2015-07-21 2016-08-16 Palantir Technologies Inc. Systems and models for data analytics
US9392008B1 (en) 2015-07-23 2016-07-12 Palantir Technologies Inc. Systems and methods for identifying information related to payment card breaches
US9996595B2 (en) 2015-08-03 2018-06-12 Palantir Technologies, Inc. Providing full data provenance visualization for versioned datasets
US10127289B2 (en) 2015-08-19 2018-11-13 Palantir Technologies Inc. Systems and methods for automatic clustering and canonical designation of related data in various data structures
US10853378B1 (en) 2015-08-25 2020-12-01 Palantir Technologies Inc. Electronic note management via a connected entity graph
US9984428B2 (en) 2015-09-04 2018-05-29 Palantir Technologies Inc. Systems and methods for structuring data from unstructured electronic data files
US9576015B1 (en) 2015-09-09 2017-02-21 Palantir Technologies, Inc. Domain-specific language for dataset transformations
US9514414B1 (en) 2015-12-11 2016-12-06 Palantir Technologies Inc. Systems and methods for identifying and categorizing electronic documents through machine learning
US9760556B1 (en) 2015-12-11 2017-09-12 Palantir Technologies Inc. Systems and methods for annotating and linking electronic documents
US10073834B2 (en) * 2016-02-09 2018-09-11 International Business Machines Corporation Systems and methods for language feature generation over multi-layered word representation
US10248722B2 (en) 2016-02-22 2019-04-02 Palantir Technologies Inc. Multi-language support for dynamic ontology
US10698938B2 (en) 2016-03-18 2020-06-30 Palantir Technologies Inc. Systems and methods for organizing and identifying documents via hierarchies and dimensions of tags
US9846840B1 (en) * 2016-05-25 2017-12-19 Adobe Systems Incorporated Semantic class localization in images
US10007674B2 (en) 2016-06-13 2018-06-26 Palantir Technologies Inc. Data revision control in large-scale data analytic systems
US11106692B1 (en) 2016-08-04 2021-08-31 Palantir Technologies Inc. Data record resolution and correlation system
US10133588B1 (en) 2016-10-20 2018-11-20 Palantir Technologies Inc. Transforming instructions for collaborative updates
US10102229B2 (en) 2016-11-09 2018-10-16 Palantir Technologies Inc. Validating data integrations using a secondary data store
US9946777B1 (en) 2016-12-19 2018-04-17 Palantir Technologies Inc. Systems and methods for facilitating data transformation
US9922108B1 (en) 2017-01-05 2018-03-20 Palantir Technologies Inc. Systems and methods for facilitating data transformation
US11074277B1 (en) 2017-05-01 2021-07-27 Palantir Technologies Inc. Secure resolution of canonical entities
US10839154B2 (en) * 2017-05-10 2020-11-17 Oracle International Corporation Enabling chatbots by detecting and supporting affective argumentation
US10817670B2 (en) * 2017-05-10 2020-10-27 Oracle International Corporation Enabling chatbots by validating argumentation
US11586827B2 (en) * 2017-05-10 2023-02-21 Oracle International Corporation Generating desired discourse structure from an arbitrary text
US11386274B2 (en) * 2017-05-10 2022-07-12 Oracle International Corporation Using communicative discourse trees to detect distributed incompetence
US10679011B2 (en) * 2017-05-10 2020-06-09 Oracle International Corporation Enabling chatbots by detecting and supporting argumentation
US20220284194A1 (en) * 2017-05-10 2022-09-08 Oracle International Corporation Using communicative discourse trees to detect distributed incompetence
US11615145B2 (en) * 2017-05-10 2023-03-28 Oracle International Corporation Converting a document into a chatbot-accessible form via the use of communicative discourse trees
US10599885B2 (en) * 2017-05-10 2020-03-24 Oracle International Corporation Utilizing discourse structure of noisy user-generated content for chatbot learning
US11373632B2 (en) * 2017-05-10 2022-06-28 Oracle International Corporation Using communicative discourse trees to create a virtual persuasive dialogue
CN110612525B (en) * 2017-05-10 2024-03-19 甲骨文国际公司 Enabling a tutorial analysis by using an alternating speech tree
US10956406B2 (en) 2017-06-12 2021-03-23 Palantir Technologies Inc. Propagated deletion of database records and derived data
US10839161B2 (en) 2017-06-15 2020-11-17 Oracle International Corporation Tree kernel learning for text classification into classes of intent
US11100144B2 (en) 2017-06-15 2021-08-24 Oracle International Corporation Data loss prevention system for cloud security based on document discourse analysis
US10691729B2 (en) 2017-07-07 2020-06-23 Palantir Technologies Inc. Systems and methods for providing an object platform for a relational database
US11182412B2 (en) 2017-09-27 2021-11-23 Oracle International Corporation Search indexing using discourse trees
EP3688609A1 (en) 2017-09-28 2020-08-05 Oracle International Corporation Determining cross-document rhetorical relationships based on parsing and identification of named entities
US10796099B2 (en) 2017-09-28 2020-10-06 Oracle International Corporation Enabling autonomous agents to discriminate between questions and requests
US10956508B2 (en) 2017-11-10 2021-03-23 Palantir Technologies Inc. Systems and methods for creating and managing a data integration workspace containing automatically updated data models
US10235533B1 (en) 2017-12-01 2019-03-19 Palantir Technologies Inc. Multi-user access controls in electronic simultaneously editable document editor
US11061874B1 (en) 2017-12-14 2021-07-13 Palantir Technologies Inc. Systems and methods for resolving entity data across various data structures
US10838987B1 (en) 2017-12-20 2020-11-17 Palantir Technologies Inc. Adaptive and transparent entity screening
US10949623B2 (en) 2018-01-30 2021-03-16 Oracle International Corporation Using communicative discourse trees to detect a request for an explanation
US11537645B2 (en) * 2018-01-30 2022-12-27 Oracle International Corporation Building dialogue structure by using communicative discourse trees
US10754822B1 (en) 2018-04-18 2020-08-25 Palantir Technologies Inc. Systems and methods for ontology migration
WO2019217722A1 (en) 2018-05-09 2019-11-14 Oracle International Corporation Constructing imaginary discourse trees to improve answering convergent questions
US11461355B1 (en) 2018-05-15 2022-10-04 Palantir Technologies Inc. Ontological mapping of data
US11455494B2 (en) 2018-05-30 2022-09-27 Oracle International Corporation Automated building of expanded datasets for training of autonomous agents
US11061542B1 (en) 2018-06-01 2021-07-13 Palantir Technologies Inc. Systems and methods for determining and displaying optimal associations of data items
US10795909B1 (en) 2018-06-14 2020-10-06 Palantir Technologies Inc. Minimized and collapsed resource dependency path
US11645459B2 (en) 2018-07-02 2023-05-09 Oracle International Corporation Social autonomous agent implementation using lattice queries and relevancy detection
US11562135B2 (en) * 2018-10-16 2023-01-24 Oracle International Corporation Constructing conclusive answers for autonomous agents
US11232264B2 (en) * 2018-10-19 2022-01-25 Verint Americas Inc. Natural language processing with non-ontological hierarchy models
US11321536B2 (en) 2019-02-13 2022-05-03 Oracle International Corporation Chatbot conducting a virtual social dialogue
US11210472B2 (en) * 2019-05-08 2021-12-28 Tata Consultancy Services Limited Automated extraction of message sequence chart from textual description
CN110472132A (en) * 2019-08-01 2019-11-19 腾讯科技(深圳)有限公司 A kind of acquisition methods, device and the medium of safe public feelings information
US11449682B2 (en) 2019-08-29 2022-09-20 Oracle International Corporation Adjusting chatbot conversation to user personality and mood
US11775772B2 (en) 2019-12-05 2023-10-03 Oracle International Corporation Chatbot providing a defeating reply

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6108619A (en) * 1998-07-02 2000-08-22 Novell, Inc. Method and apparatus for semantic characterization of general content streams and repositories
US6405199B1 (en) * 1998-10-30 2002-06-11 Novell, Inc. Method and apparatus for semantic token generation based on marked phrases in a content stream
US7152031B1 (en) * 2000-02-25 2006-12-19 Novell, Inc. Construction, manipulation, and comparison of a multi-dimensional semantic space
US7197451B1 (en) * 1998-07-02 2007-03-27 Novell, Inc. Method and mechanism for the creation, maintenance, and comparison of semantic abstracts
US7286977B1 (en) * 2000-09-05 2007-10-23 Novell, Inc. Intentional-stance characterization of a general content stream or repository
US7389225B1 (en) * 2000-10-18 2008-06-17 Novell, Inc. Method and mechanism for superpositioning state vectors in a semantic abstract
US7653530B2 (en) * 2000-07-13 2010-01-26 Novell, Inc. Method and mechanism for the creation, maintenance, and comparison of semantic abstracts

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4884972A (en) * 1986-11-26 1989-12-05 Bright Star Technology, Inc. Speech synchronized animation
US5528472A (en) * 1987-10-23 1996-06-18 Mag Instrument, Inc. Rechargeable miniature flashlight
US5418716A (en) * 1990-07-26 1995-05-23 Nec Corporation System for recognizing sentence patterns and a system for recognizing sentence patterns and grammatical cases
US5371807A (en) * 1992-03-20 1994-12-06 Digital Equipment Corporation Method and apparatus for text classification
US5406956A (en) * 1993-02-11 1995-04-18 Francis Luca Conte Method and apparatus for truth detection
US5918222A (en) * 1995-03-17 1999-06-29 Kabushiki Kaisha Toshiba Information disclosing apparatus and multi-modal information input/output system
US5721938A (en) * 1995-06-07 1998-02-24 Stuckey; Barbara K. Method and device for parsing and analyzing natural language sentences and text
US6076051A (en) * 1997-03-07 2000-06-13 Microsoft Corporation Information retrieval utilizing semantic representation of text
US6098033A (en) * 1997-07-31 2000-08-01 Microsoft Corporation Determining similarity between words
US6081774A (en) * 1997-08-22 2000-06-27 Novell, Inc. Natural language information retrieval system and method
US5960384A (en) * 1997-09-03 1999-09-28 Brash; Douglas E. Method and device for parsing natural language sentences and other sequential symbolic expressions
US6269368B1 (en) 1997-10-17 2001-07-31 Textwise Llc Information retrieval using dynamic evidence combination
EP0962873A1 (en) * 1998-06-02 1999-12-08 International Business Machines Corporation Processing of textual information and automated apprehension of information
EP1006452A2 (en) 1998-12-02 2000-06-07 Pitney Bowes Inc. A method and system for creating a rules-based document
US8095581B2 (en) * 1999-02-05 2012-01-10 Gregory A Stobbs Computer-implemented patent portfolio analysis method and apparatus
US6584464B1 (en) * 1999-03-19 2003-06-24 Ask Jeeves, Inc. Grammar template query system
US6513006B2 (en) * 1999-08-26 2003-01-28 Matsushita Electronic Industrial Co., Ltd. Automatic control of household activity using speech recognition and natural language
US6415257B1 (en) * 1999-08-26 2002-07-02 Matsushita Electric Industrial Co., Ltd. System for identifying and adapting a TV-user profile by means of speech technology
US6330537B1 (en) * 1999-08-26 2001-12-11 Matsushita Electric Industrial Co., Ltd. Automatic filtering of TV contents using speech recognition and natural language
US6353810B1 (en) * 1999-08-31 2002-03-05 Accenture Llp System, method and article of manufacture for an emotion detection system improving emotion recognition
US6601026B2 (en) * 1999-09-17 2003-07-29 Discern Communications, Inc. Information retrieval by natural language querying
US6766320B1 (en) * 2000-08-24 2004-07-20 Microsoft Corporation Search engine with natural language-based robust parsing for user query and relevance feedback learning
US6950104B1 (en) * 2000-08-30 2005-09-27 Microsoft Corporation Methods and systems for animating facial features, and methods and systems for expression transformation
US7027974B1 (en) * 2000-10-27 2006-04-11 Science Applications International Corporation Ontology-based parser for natural language processing
WO2003012661A1 (en) * 2001-07-31 2003-02-13 Invention Machine Corporation Computer based summarization of natural language documents
US6826568B2 (en) * 2001-12-20 2004-11-30 Microsoft Corporation Methods and system for model matching
US7693720B2 (en) * 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6108619A (en) * 1998-07-02 2000-08-22 Novell, Inc. Method and apparatus for semantic characterization of general content streams and repositories
US7197451B1 (en) * 1998-07-02 2007-03-27 Novell, Inc. Method and mechanism for the creation, maintenance, and comparison of semantic abstracts
US6405199B1 (en) * 1998-10-30 2002-06-11 Novell, Inc. Method and apparatus for semantic token generation based on marked phrases in a content stream
US7152031B1 (en) * 2000-02-25 2006-12-19 Novell, Inc. Construction, manipulation, and comparison of a multi-dimensional semantic space
US7475008B2 (en) * 2000-02-25 2009-01-06 Novell, Inc. Construction, manipulation, and comparison of a multi-dimensional semantic space
US7653530B2 (en) * 2000-07-13 2010-01-26 Novell, Inc. Method and mechanism for the creation, maintenance, and comparison of semantic abstracts
US7286977B1 (en) * 2000-09-05 2007-10-23 Novell, Inc. Intentional-stance characterization of a general content stream or repository
US7562011B2 (en) * 2000-09-05 2009-07-14 Novell, Inc. Intentional-stance characterization of a general content stream or repository
US7389225B1 (en) * 2000-10-18 2008-06-17 Novell, Inc. Method and mechanism for superpositioning state vectors in a semantic abstract

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9009029B1 (en) * 2012-11-01 2015-04-14 Digital Reasoning Systems, Inc. Semantic hashing in entity resolution
US10769535B2 (en) 2016-05-13 2020-09-08 Cognitive Scale, Inc. Ingestion pipeline for universal cognitive graph
US10796227B2 (en) 2016-05-13 2020-10-06 Cognitive Scale, Inc. Ranking of parse options using machine learning
US10565504B2 (en) 2016-05-13 2020-02-18 Cognitive Scale, Inc. Structurally defining knowledge elements within a cognitive graph
US10699196B2 (en) 2016-05-13 2020-06-30 Cognitive Scale, Inc. Lossless parsing when storing knowledge elements within a universal cognitive graph
US10706357B2 (en) * 2016-05-13 2020-07-07 Cognitive Scale, Inc. Ingesting information into a universal cognitive graph
US10706358B2 (en) 2016-05-13 2020-07-07 Cognitive Scale, Inc. Lossless parsing when storing knowledge elements within a universal cognitive graph
US10528870B2 (en) 2016-05-13 2020-01-07 Cognitive Scale, Inc. Natural language query procedure where query is ingested into a cognitive graph
US20170330104A1 (en) * 2016-05-13 2017-11-16 Cognitive Scale, Inc. Ingesting Information into a Universal Cognitive Graph
US10719766B2 (en) 2016-05-13 2020-07-21 Cognitive Scale, Inc. Universal cognitive graph architecture
US10860932B2 (en) 2016-05-13 2020-12-08 Cognitive Scale, Inc. Universal graph output via insight agent accessing the universal graph
US10860933B2 (en) 2016-05-13 2020-12-08 Cognitive Scale, Inc. Universal graph output via insight agent accessing the universal graph
US10860936B2 (en) 2016-05-13 2020-12-08 Cognitive Scale, Inc. Universal quantification of knowledge elements within a cognitive graph
US11295216B2 (en) 2016-05-13 2022-04-05 Cognitive Scale, Inc. Structurally defining knowledge elements within a cognitive graph
US11244229B2 (en) 2016-05-13 2022-02-08 Cognitive Scale, Inc. Natural language query procedure where query is ingested into a cognitive graph
US11227127B2 (en) 2019-09-24 2022-01-18 International Business Machines Corporation Natural language artificial intelligence topology mapping for chatbot communication flow

Also Published As

Publication number Publication date
US8725493B2 (en) 2014-05-13
US20080228467A1 (en) 2008-09-18

Similar Documents

Publication Publication Date Title
US8725493B2 (en) Natural language parsing method to provide conceptual flow
US8463720B1 (en) Method and apparatus for defining an artificial brain via a plurality of concept nodes defined by frame semantics
US20080243741A1 (en) Method and apparatus for defining an artificial brain via a plurality of concept nodes connected together through predetermined relationships
Schubert et al. Episodic Logic meets Little Red Riding Hood: A comprehensive, natural representation for language understanding
MacWhinney The competition model
Bowman et al. A large annotated corpus for learning natural language inference
US9064211B2 (en) Method for determining relationships through use of an ordered list between processing nodes in an emulated human brain
Jackendoff et al. Morphological schemas: Theoretical and psycholinguistic issues
US7899666B2 (en) Method and system for automatically extracting relations between concepts included in text
US5960384A (en) Method and device for parsing natural language sentences and other sequential symbolic expressions
Solan Pronominal reference: Child language and the theory of grammar
Binsted et al. Computational rules for generating punning riddles
US20100088262A1 (en) Emulated brain
Morris et al. A connectionist simulation of the empirical acquisition of grammatical relations
MacWhinney Competition and connectionism
Delmonte Computational Linguistic Text Processing: Logical Form, Semantic Interpretation, Discourse Relations and Question Answering
Schank A conceptual dependency representation for a computer-oriented semantics
Shastri et al. Semantic networks and neural nets
Scott The logos model: An historical perspective
Gomez et al. Determining prepositional attachment, prepositional meaning, verb meaning, and thematic roles
Szymanski et al. Towards avatars with artificial minds: Role of semantic memory
Garcia Rosa et al. Linguistic relations encoding in a symbolic-connectionist hybrid natural language processor
Yang Natural-Language Input for the Scone Knowledge-Base System
Chan Integrating linguistic primitives in learning context-dependent representation
Sierra-Santibáñez Propositional logic syntax acquisition

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION