WO2002077904A2 - Method and apparatus for intelligent data assimilation - Google Patents

Method and apparatus for intelligent data assimilation Download PDF

Info

Publication number
WO2002077904A2
WO2002077904A2 PCT/US2002/009528 US0209528W WO02077904A2 WO 2002077904 A2 WO2002077904 A2 WO 2002077904A2 US 0209528 W US0209528 W US 0209528W WO 02077904 A2 WO02077904 A2 WO 02077904A2
Authority
WO
WIPO (PCT)
Prior art keywords
data
request
communications link
workflow
search
Prior art date
Application number
PCT/US2002/009528
Other languages
French (fr)
Other versions
WO2002077904A3 (en
Inventor
David C. Wachtel
Original Assignee
Us Search.Com Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Us Search.Com Inc. filed Critical Us Search.Com Inc.
Priority to EP02728596A priority Critical patent/EP1435047A4/en
Priority to AU2002258640A priority patent/AU2002258640B2/en
Publication of WO2002077904A2 publication Critical patent/WO2002077904A2/en
Publication of WO2002077904A3 publication Critical patent/WO2002077904A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • G06F16/972Access to data in other repository systems, e.g. legacy data or dynamic Web page generation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • Y10S707/955Object-oriented
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • Y10S707/99945Object-oriented database structure processing

Definitions

  • This invention relates generally to the field of servers coupled to computer networks and more specifically to servers used as search engines.
  • Web servers are software applications hosted by computer systems operably connected to the Internet. Web servers receive requests from Web clients and provide responses in the form of documents written in a document markup language such as Hyper Text Markup Language (HTML).
  • HTML Hyper Text Markup Language
  • the first Web servers operated as connections to file systems and served static HTML documents stored as files within a computer system.
  • the static HTML documents were quickly replaced by programs that accept as input parameters a client request and generate as output a document. These programs are typically written in a variety of interpreted procedural languages such as Perl and acquired procedural languages such as C.
  • the use of document generating programs also introduced new capabilities. Data could be drawn from any source and used to create a document. For example, databases could be queried and the results used within a document.
  • the document generating programs could create side effects during the execution of a document generating program. For example, a typical side effect may include creation and storage of tracking statistics for later use and analysis by a Web site's host.
  • the application server comprises an interpreter and the document generating programs are written in a proprietary programming language, hi the case where the proprietary programming language is a procedural language, the application server reads and runs the document generating programs without invoking a new process.
  • the proprietary programming language is an object oriented language such as Java
  • each document generating program is a software object known as a servlet.
  • the Web server invokes the servlets as needed and the servlets handle the remaining document generation functions.
  • a servlet once invoked includes all of the necessary logic necessary to acquire data according to a client request, format the acquired data into a document, and report back to the hosting system any statistics about the client/servlet interaction.
  • Finer and finer vertical slices through a Web site leads to an unscalable Web server architecture because all the logic for acquiring data, generating a document, and producing side effects is included within a monolithic software object thus defeating the ability to reuse or integrate a particular software object in another system or software object.
  • a method for fulfilling a data service request.
  • the method includes providing an ontology description of a data service a first logical search object operably coupled via a first communications link to a data provider.
  • the first logical search object transmits to the data provider via the communications link a search request generated by the first logical search obj ect from the data service request.
  • the first logical search object receives from the data provider via the communications link a data set in response to the search request.
  • the first logical search object generates a knowledge instance from the data set using the ontology description.
  • the method for fulfilling a data service request is adapted for a plurality of data providers including a database server, a FTP server, a Web server, a file system, and a human data provider. Additionally, the method is adapted for a communications protocol proprietary to a data provider.
  • the data service request is included in a XML document.
  • the method for fulfilling a data service request further includes providing a first workflow operably coupled to the logical search obj ect and transmitting by the first logical search object to the first workflow the knowledge instance.
  • the method for fulfilling a data service request further includes providing a second logical search object operably coupled to the first workflow, with the first workflow encapsulating the ontological relationship between the first and second logical search objects.
  • the method for fulfilling a data service request further includes providing an application server operably coupling a data client to the first workflow via a second communications link.
  • the application server receives from the data client via the second communications link a data service request message with the data service request included in the data service request message.
  • the application server transmits to the first workflow the data service request message and the first workflow transmits to the logical search object the data service request message.
  • the second communications link is adapted for communications using SMTP, JMS, HTTP, or RMI.
  • the method for fulfilling a data service request further includes providing a formatter and formatting by the formatter the data set encapsulated in the knowledge instance into a format requested by the data client.
  • a data processing system is adapted to access a data provider via a communications link.
  • the data processing system includes a processor and a memory operably coupled to the processor and having program instructions stored therein, the processor being operable to execute the program instructions.
  • the program instructions include receiving by a software object a search request message document and generating by the software object a data request for the data provider from the search request message document.
  • the software obj ect transmits to the data provider the data request via the communications link.
  • the software object receives from the data provider a data set via the communications link and generates a semantic object from the data set.
  • s data processing system adapted to access a data provider via a communications link includes a parser adaptor operably coupled to a software object.
  • a parser semantic description of the data set and a semantic object semantic description is provided for use by the parser adaptor.
  • the parser adaptor generates extracted data from the data set using the parser semantic description.
  • the parser adaptor then generates a semantic object using the extracted data according to the semantic object semantic description.
  • a data processing system adapted to access a data provider via a communications link includes a request builder operably coupled to a software object.
  • a native object is operably coupled to the request builder with the native object encapsulating implementation details of a data request for the data provider.
  • a native semantic description including ontology information describing a data structure used by the request builder to build a data request for the data provider.
  • the request builder transmits to the native object a search request and the native object generates a data request from the search request using the native semantic description.
  • FIG. 1 is a deployment diagram of an intelligent data assimilation system according to the present invention
  • FIG. 2 is a hardware architecture diagram of a general purpose computer suitable for use as an intelligent data assimilation system host;
  • FIG. 3 is an architecture diagram illustrating an embodiment of an intelligent data assimilation system according to the present invention
  • FIG.4 is a diagram of the levels within one embodiment of an intelligent data assimilation system ontology according to the present invention
  • FIG. 5 is a diagram depicting an exemplary embodiment of an ontological relationship between semantic constructs and their associated logical search objects in an ontology according to the present invention
  • FIG.6 is a diagram depicting an exemplary embodiment of how service classes are created using the ontological relationships of logical search objects according to the present invention
  • FIGS. 7a and 7b are diagrams depicting how semantic objects are described within the metadata store;
  • FIG. 8 is a diagram depicting the operations of an embodiment of an exemplary workflow according to the present invention.
  • FIG. 9 is a collaboration diagram depicting the interactions within an embodiment of a workflow process according to the present invention.
  • FIG. 10 depicts an exemplary embodiment of a logical search object used within a workflow according to the present invention
  • FIG. 11 is a diagram depicting the layered architecture of an embodiment of a generic logical search object according to the present invention and how this layered architecture is used to generate logical search objects with various connectivity capabilities;
  • FIG. 12 is a sequence diagram illustrating the sequence of events that occur when a request message is received by an embodiment of an intelligent data assimilation system according to the present invention
  • FIG. 13 illustrates an intelligent data assimilation system employing logical search objects according to the present invention
  • FIG. 14 is an XML document embodiment of an exemplary service request message according to the present invention
  • FIG. 15 is an XML document embodiment of an exemplary status message according to the present invention
  • FIG. 16 is an XML document embodiment of an exemplary cancel message according to the present invention
  • FIG. 17 is an XML document embodiment of an exemplary output message according to the present invention.
  • FIG. 18 is an XML document embodiment of an exemplary response message according to the present invention.
  • FIG. 1 is a deployment diagram of an intelligent data assimilation system according to the present invention.
  • An intelligent data assimilation system host 10 is operably coupled to a communications network 14, such as the Internet, via a communications link 12 adapted for communications using a variety of transport protocols such as Transmission Control Protocol/Internet Protocol (TCP/IP).
  • the logical search object host communicates with a data provider host 22 operably coupled to the communications network by a communications link 20 adapted for communications using TCP/IP .
  • the data provider host is operably coupled to a data store such as, but not limited to, a data provider database 24.
  • a data client host 18 is operably coupled to the intelligent data assimilation system host via a communications link 16 adapted for communications using TCP/IP.
  • a data client 26 hosted by the data client host transmits a service request message 27 to an intelligent data assimilation system 101 hosted by the intelligent data assimilation system host.
  • the service request message includes search parameters that the intelligent data assimilation system uses to generate and transmit a query 29 to a data provider server 28 hosted by the data provider host.
  • the data provider server responds to the query by generating and transmitting a data set 31 back to the intelligent data assimilation system.
  • the intelligent data assimilation system populates the ontological instance of the returned data and formats the data set into a formatted data result 33 transmitted to the data client as a response to the service request message.
  • FIG. 2 is a hardware architecture diagram of a general purpose computer suitable for use as an intelligent data assimilation system host.
  • a processor 300 comprising a Central Processing Unit (CPU). 310, a memory cache 320, and a bus interface 330, is operatively coupled via a system bus 335 to a main memory 340 and a I/O control unit 345.
  • the I/O control unit is operatively coupled via a I/O local bus 350 to a disk storage controller 395, and a network controller 380.
  • the disk storage controller is operatively coupled to a disk storage device 325.
  • Computer program instructions 397 for implementing an intelligent data assimilation system are stored on the disk storage device until the microprocessor retrieves the computer program instructions and stores them in the main memory.
  • the microprocessor then executes the computer program instructions stored in the main memory to implement the features of an intelligent data assimilation system.
  • the network controller is operatively coupled to a communications device 396.
  • the communications device is adapted to allow an intelligent data assimilation system hosted by the general purpose computer to communicate via a computer network such as the Internet with other software objects on the computer network.
  • FIG. 3 is an architecture diagram illustrating an embodiment of a an intelligent data assimilation system according to the present invention.
  • An intelligent data assimilation system is coupled to a computer network through the services of an application server (not shown).
  • the intelligent data assimilation system is an object-oriented component-based server platform providing mechanisms and subsystems to create and manipulate a repository 102 of Logical Search Objects (LSOs) that encapsulate knowledge and capabilities used to execute a search for data from a plurality of data providers 104.
  • LSOs Logical Search Objects
  • the intelligent data assimilation system is accessed by a data client via the communications network through the application server.
  • the data providers are logically coupled to the intelligent data assimilation system using a variety of methods.
  • the data providers provide data sets including various data products from data providers such as data aggregators, search robots or agents, HTML-based search engines, in-house databases, or any other mechanism capable of providing a mechanism to extract electronic information.
  • a data provider can be a human actor receiving a request from the intelligent data assimilation system to find and return a data set that is not electronically accessible.
  • a private investigator may be used to collect data for a data set such as confirming the physical location of a person.
  • a LSO 106 is a reusable, configurable and self-contained software component capable of establishing connectivity to a particular data set provided by a data provider.
  • a LS O includes logic for qualifying or filtering a data set and delivering the resulting information as a document written in extensible Markup Language (XML) or as an intelligent data object herein termed a semantic object 108. Additionally, a LSO reveals its capabilities and information to other software components for use by those software components.
  • XML extensible Markup Language
  • semantic object 108 an intelligent data object herein termed a semantic object 108.
  • the complete definition of a LSO is stored in a database 110 as a metadata description.
  • an LSO instance is created from the LSO's metadata definition.
  • the intelligent data assimilation system has the ability to pre-instantiate LSOs if needed.
  • the intelligent data assimilation system provides data mapping tools to add semantic intelligence to atomic data with the purpose of providing users the ability to manipulate LSO results.
  • an intelligent data assimilation system provides abstracted views of data allowing users to manipulate encapsulated data.
  • an intelligent data assimilation system can group a LSO called "First Name” 114 and a LSO called "Last Name” 116 into a semantic object 118 called "Name”, and then group "Name” with another semantic obj ect called "Address” 108 into a semantic object called "Person” 120.
  • This semantic structure allows intelligent traversal and identification of any available information.
  • the LSOs provide a mechanism for the logical search object subsystems to provide tools 122 for the definition of services that may be requested from the intelligent data assimilation system by a data client.
  • the services generate results in the form of data products including data requested by the data client.
  • designers view all the available LSOs and semantic objects in a repository and manipulate LSOs and select desired LSOs or semantic objects and place LSOs and semantic objects into workflows.
  • Each workflow typically represents a single service offering.
  • LSOs into a workflow a designer creates workflows resulting in a service.
  • Some of the features of these workflows include the ability to run multiple LSOs in parallel, define failover LSOs, and define contingencies where the output of one LSO can be the input of another object.
  • An ontology designer 122 is used to create the data mapping of data from a data provider into an intelligent data assimilation system ontology. It is through this process that the intelligent abstractions of data are performed to create semantic obj ects used by the LSOs maintained in the intelligent data assimilation system repository.
  • a product configurator 122 is used to browse and select semantic objects and apply workflow definitions with the purpose of creating intelligent data assimilation system services.
  • the product configurator provides robust productivity features such as drag and drop functionality and wizard style forms.
  • the product configurator surfaces the information encapsulated in semantic objects. This allows the designer to manipulate data as simple atomic data or highly abstracted complex data. Data encapsulated in a LSO is mapped as a single semantic object, thus the designer is able to place that semantic object representing a particular LSO into a workflow.
  • semantic objects are not constrained by LSOs, the product configurator allows the designer to manipulate even higher abstracted semantic objects that straddle multiple LSOs and thus the product configurator derives which LSOs are to be inserted into a workflow for a given service and what constraints, if any, exist between the LSOs encapsulated in a semantic object.
  • the product configurator is able to manipulate abstractions of data and place those items into a workflow.
  • the product configurator also provides the designer the ability to configure other attributes about a data product generated by the service including input display attributes, input rules, result extensible Style Language (XSL) stylesheets, and other attributes that define the data product.
  • XSL result extensible Style Language
  • the intelligent data assimilation system consolidates the results of all LSOs executed in a workflow and represents the resulting data as XML documents that can be delivered to a data client in that form, hi one embodiment, the intelligent data assimilation system reformats the XML documents to create formatted outputs 124 using XSL stylesheets 125. These XSL stylesheets provide for the rendering of formatted data, including the simultaneous delivery of results in HTML 126, XML, Wireless Application Protocol (WAP) 128, Rich Text Format
  • An intelligent data assimilation system ontology facilitates the exchange of data between multiple computer systems independently of the individual system technologies, information architectures and application domain, and allows the creation of groupings of atomic data fields into intelligent building blocks termed semantic objects allowing the construction of services provided by the intelligent data assimilation system.
  • An ontology includes a vocabulary of terms and a specification of what those terms mean.
  • the intelligent data assimilation system ontology provides a set of well-founded constructs that are used to build meaningful higher-level knowledge.
  • Basic terms in the intelligent data assimilation system ontology are selected such that basic foundational concepts and distinctions are defined and specified.
  • the basic terms chosen form a complete set, whose relationship to one another is defined using formal techniques. These formally defined relationships provide the semantic basis for the terminology chosen. It is these relationships included in the intelligent data assimilation system ontology that enables the expression of domain-specific knowledge without including domain-specific terms.
  • An intelligent data assimilation system ontology includes definitions of abstractions. For example, a person semantic is constructed by the underlying atomic or semantic items, first name, middle name, and last name. The definition is used to create a knowledge instance of the ontology, for example when at runtime the ontology is populated with specific semantic and atomic instances returned by the LS Os .
  • a knowledge instance created at run time is either transient or persistent.
  • requests are typically transient because l ⁇ iowledge is persisted into the ontology only for the duration of a particular request instance and then discarded.
  • data is retrieved from a data provider, a portion of the ontology corresponding to LSOs in a workflow is populated using a run-time knowledge instance, and only the knowledge instance created within a particular request instance is processed and once processed the knowledge instance is discarded from the ontology.
  • knowledge instances are persistent such that the knowledge instances can be later queried from the intelligent data assimilation system and assimilated into another service request.
  • a persistent knowledge instance thus has global scope as described above.
  • Commerce agents using the intelligent data assimilation system are adaptable and able to communicate domain-specific knowledge. Commerce agents do this by using the basic terms and relationships defined in the intelligent data assimilation system ontology and through these definitions of the basic terms the intelligent data assimilation system allows the basic terms to be grouped (abstracted) to form higher-level intelligent constructs.
  • an intelligent data assimilation system ontology is stored in a metastore as an XML definition and is represented at many different levels using its XML definition.
  • An intelligent data assimilation system enforces security for all data clients.
  • the fundamental mechanism for performing security is name and password authentication against the Java Naming and Directory Interface (JNDi) tree in the intelligent data assimilation system.
  • JMS Java Messaging Service
  • RMI Remote Method Invocation
  • the intelligent data assimilation system also supports the use of digital signatures. Data sent to the server may include a digital signature that can be verified against a checksum generated from the request and a stored public key. The intelligent data assimilation system also supports data encryption. Once the intelligent data assimilation system has accepted a service request message at a processing tier, the intelligent data assimilation system secures the service request against available services for that customer. Clients do not have access to services not available to them.
  • An intelligent data assimilation system also enforces the return of results to the originating data client. This type of security restricts access to search results and allows access to data by those data clients that submitted a corresponding service request.
  • the intelligent data assimilation system supports a notification mechanism that allows for multiple notifications for any given request. This feature allows the intelligent data assimilation system to respond to a service request by sending results to one or all of a user's email, cell phone, web page etc.
  • the intelligent data assimilation system enforces business rules that relate to particular business processes. These business rules are abstracted into a generalized mechanism through which an intelligent data assimilation system conditionally evaluates and determines an appropriate action.
  • the intelligent data assimilation system supports the ability to perform rules validation at various levels of the intelligent data assimilation system.
  • the intelligent data assimilation system rule architecture is open such that there are well defined exit points that are expected to return boolean values that indicate the success or failure of any rule. These rules are implemented by deployment of any business process, be it a manual intervention, a proprietary algorithm, or an off-the-shelf rules engine.
  • the intelligent data assimilation system architecture is designed to take advantage of current enterprise development standards.
  • components of an intelligent data assimilation system adhere to the Java 2 Enterprise Edition (J2EE) standard for developing multi-tiered enterprise applications.
  • J2EE simplifies enterprise applications by basing them on standardized, modular components, by providing a complete set of services to those components, and by handling many details of application behavior automatically, without complex programming.
  • This embodiment of an intelligent data assimilation system also takes advantage of many features of the J2EE standards such as software portability, the JDBC API for database access, and a security model that protects data even in Internet applications.
  • the underlying platform for these components is a J2EE compatible application server.
  • a suitable application server is BEA Weblogic Application Server. The capabilities of BEA's
  • Weblogic Application Server include a distributed transaction manager, Java Messaging Services (JMS), support for extensible Markup Language (XML), and J2EE standards support.
  • JMS Java Messaging Services
  • XML extensible Markup Language
  • J2EE standards support.
  • an intelligent data assimilation system uses a BEA Weblogic Process Integrator (WLPI) to implement workflow tasks.
  • WLPI provides the intelligent data assimilation system the ability to design and automate business processes that integrate data provider applications, search services, and human intervention.
  • DBMS Oracle DataBase Management System
  • DBMS provides the intelligent data assimilation system with persistence capabilities.
  • FIG. 4 is a diagram of levels within an exemplary portion of one embodiment of an intelligent data assimilation system ontology according to the present invention.
  • the intelligent data assimilation system ontology manages three levels of data abstraction. Initially, raw data fields 140 in a data set are positionally mapped from a data provider into atomic objects such as person atomic object 146 and credit atomic object 148. Atomic objects add type and basic naming information to the data fields. Intelligence about the meaning of the data fields included in the atomic objects is created through the semantic object abstractions in the semantic level 142.
  • the intelligent data assimilation system views data fields as the most basic elements of any data set retrievable by the intelligent data assimilation system. Prior to the ontological definition, these data fields generally include no semantic context or intelligence. Character or byte strings, numbers, decimals and dates can be identified; however, no meaningful definition is applied providing understanding of the data. For example, position 1 to 9 in a data field may include the value "561929975" but a typical data assimilation system does not understand whether this is a social security number, order number, or product number. Associating a semantic context to the data field by an intelligent data assimilation system results in an atomic object.
  • Atomic obj ects have a semantic context associated with a value of a particular type.
  • data fields can be translated into an atomic object.
  • the value might be a string "John 10/9/2000 21:23.234 8A56FB" and include semantic information in the format: First Name, Date, and Result of Query JJD.
  • Semantic objects are groupings of atomic objects, defined in the intelligent data assimilation system ontology, which have a particular business meaning. Intelligence is created by these higher-level groupings. For example, the following atomic objects: first name 147, middle initial 151, and last name 149 can be grouped to provide a "person name" semantic object 150. Additionally, the following atomic objects: addressl 153, city 155, state 157, and zip code
  • Semantic objects are grouped in a data service level 144 of the intelligent data assimilation system ontology to create high-level data abstractions used by other sub-systems within in the intelligent data assimilation system.
  • FIG. 5 is a diagram depicting an exemplary embodiment of an ontological relationship between semantic constructs and their associated LSOs in an ontology according to the present invention.
  • the exemplary embodiment depicts a legal entity class 200 with two children classes, person 218 and corporation 202.
  • Each of these classes includes a child class encapsulating the concept of where an instantiation of the legal entity class may live as defined for legal purposes.
  • the appropriate class is the domicile class 220 and for the corporation the appropriate class is the corporate address class 204.
  • the classes street address 212, city 210, Zip Code 208, and state 206 have multiple parents through multiple inheritance, namely the domicile and corporate address class.
  • the street class further includes two methods, method 1 214 and method 2 216, for retrieving a street address. These methods can be addressed and used by an object sending a request to an instantiation of the domicile or corporate address class in order to customize the data provider for retrieving a street address.
  • a request for data about a person's domicile may include a request for a street address from tax records.
  • Another request for data about the same person's domicile may include a request for a street address pulled from vehicle registration records, hi each case, the data provider is different arid a different method may be used to request and receive data from the different data providers.
  • instantiations of the domicile class or the corporation class answer requests for returning the legal address of aperson or a corporation respectively. For example, if a request is made to find the legal address of a person when the system receives the appropriate request, it invokes a workflow that will instantiate the appropriate LSOs to fetch the requested data. Each of these objects contacts a data provider and requests and receives the appropriate data from the data provider. Associated with this workflow an instance of the ontology will be invoked and when the data has been retrieved, any node, (such as domicile in the example), in the instantiated ontology is able to represent the underlying aggregation of data for that particular node in the ontology. Thus, the domicile object with its ability to aggregate the received data, is capable of representing the person's address.
  • FIG.6 is a diagram depicting an exemplary embodiment of how service classes are created using the ontological relationships of LSOs according to the present invention.
  • the ontological relationships between LSOs are selectively traversed using a data service class.
  • Such a traversal through the ontology results in an answer to a query.
  • exemplary service class, Service 1 222 encapsulates a query "Where does John (aperson) live?".
  • An instantiation of the Service 1 class results in a traversal of the ontological relationships from person 218, to domicile 220, to street 212 using method 1 214, city 210, Zip Code, 208, and state 206. This traversal is shown as a solid line surrounding the appropriate objects.
  • exemplary service class, Service 2224 encapsulates a query "Where can ABC corporation (a legal entity) be served?".
  • An instantiation of the Service 2 class results in a traversal of the ontological relationships from legal entity 200, to corporation 202, to corporate address 220, to street 212 using method 2 216, to city 210, to Zip Code, 208, and to state 206. This traversal is shown as a broken line surrounding the appropriate objects.
  • the exemplary services exploit the ontological relationships between the LSOs by accessing the relationships in different ways.
  • Service 1 encapsulates a query about a person's address
  • Service 2 encapsulates a query about a corporations place of service. In each case, the service must determine a street address and each service ultimately uses the same street, city, Zip Code, and state object classes. However, each service obtained access to these objects from a different parent class.
  • the semantic obj ect construct is not limited to a particular LSO and thus a particular data set.
  • the intelligent data assimilation system is capable of extrapolating LSOs and LSO's appropriate position in multiple workflows from the semantic object. Definitions of the intelligent data assimilation system ontology and the included semantic objects are held in a metadata store of the intelligent data assimilation system and are part of a global internal data dictionary.
  • a metadata store provides persistence for configuration data used by LSOs at execution time.
  • a search configuration includes: an identification of the data provider, a mapping of the results data fields to the ontology of atomic objects known to the intelligent data assimilation system, and a mapping of the request data fields to the ontology of atomic objects known to the data provider.
  • the search configuration includes reference to a LSO to use when performing a search at execution time.
  • FIGS. 7a and 7b are diagrams depicting how semantic objects are described within the metadata store.
  • a semantic object may include other semantic objects and ultimately atomic objects of varying types.
  • a semantic object person 400 includes another semantic object address 402, which includes state 404, city 406, Zip Code 408, and street number 410 atomic objects.
  • the person semantic object also includes atomic objects first name 412 and last name 414.
  • FIG. 7b is a depiction of how the semantic object of FIG. 7a is constructed using a semantic descriptor stored in the metadata store within an embodiment of an intelligent data assimilation system according to the present invention.
  • a generic descriptor, such as "city" data descriptor 420 is a concrete implementation of data descriptor including set methods.
  • a data descriptor describes the name of the data, the type of the data and the data sequence. Since the data is just accessed, a data descriptor interface only exposes get methods for accessing a data provider.
  • a semantic descriptor such as "person” semantic descriptor 416 exposes methods to retrieve child elements that the person semantic descriptor includes. These child elements could be other semantic descriptors, such as "address” semantic descriptor 418 or data descriptors such as "first name” data descriptor 428 and "last name” data descriptor 430.
  • An interface defined in the intelligent data assimilation system has a concrete class that implements the functionality of the interface. For example, a data descriptor is implemented by a generic data descriptor. This class provides the set methods that are not exposed in the data descriptor interface. During initialization, components creating descriptors use a concrete class to create set values. The following is a list of descriptors:
  • SemanticDescriptor exposes get methods elements, hasMoreElements, getNextElement
  • SemanticLisfDescriptor exposes get methods getAtomicLength GenericS emanticListDescrip tor -> concrete implementation of SemanticLisfDescriptor that includes set methods
  • the intelligent data assimilation system model abstracts how the data is represented and focuses more on how pieces of data relate to each other.
  • An intelligent data assimilation system ontology is free to change without inhibiting another data provider's ontology. This provides extensibility because the intelligent data assimilation system and data provider are loosely coupled.
  • An intelligent data assimilation system ontology constructs the semantic context where by search result definitions are defined.
  • An intelligent data assimilation system deploys a workflow engine that manages the execution of service requests that relate to services.
  • the workflow engine is a state machine that manages tasks within a workflow with each workflow having a start node followed by zero or many task nodes followed by an end node.
  • the designer In the process of defining a service the designer identifies all the LSOs that are included in a service. The LSOs are then associated with task nodes in the workflow and thus definition of a given service is achieved.
  • the intelligent data assimilation system maps the service request to a specific workflow, instantiates the workflow, consolidates the resulting data sets returned by the workflow and invokes any analytical or business rule processing for the workflow.
  • the requestor is notified that the data set is available.
  • the data set is stored (herein termed persisted) for later retrieval by the requestor.
  • the resulting data set is delivered to the requestor.
  • the intelligent data assimilation system workflow mechanism provides not only for the sequential execution of task nodes including LSOs but also provides for LSOs to be executed in parallel as well as the ability to create contingent relationships between the LSOs. For example the output of a particular LSO may be the input of another LSO. Additionally, the intelligent data assimilation system workflow mechanism allows for task nodes in the workflow to be associated with other workflows that could include services or other business processes, and thus create a mechanism whereby nesting workflows are used to generate different classes of services and workflows reused.
  • FIG. 8 is a diagram depicting the operations of an embodiment of an exemplary workflow according to the present invention.
  • the workflow processing environment used is a Weblogic Process Integrator (WLP ⁇ )supp ⁇ ied by BEA Systems, Inc. of San Jose California, USA. Workflows are specified using a Weblogic Process Integrator (WLPI) Studio software tool or by creating an XML representation of the workflow.
  • the previously described metadata store includes the data used to construct a particular instance of a workflow used to fulfill a service request. Therefore, the metadata store manages the workflow definition and the LSO definitions that fulfill a service request.
  • a workflow 600 receives a service request from a service request handler 602 in the form of a XML document.
  • the workflow validates the request at step 604.
  • An exemplary validation includes determining if the request includes sufficient information to route and complete the request using internal decision rules. If the request is invalid, results are returned in a results document 608 without performing any search tasks. If the request is validated, then search tasks are initiated at node 610 to fulfill the request.
  • a workflow can initiate multiple tasks at one time. In this exemplary workflow, task A 612 and task B 614 are initiated at one time.
  • Task A uses LSO A 616 to connect to and retrieve data from an external data provider A 618.
  • task B uses LSO B 620 to connect to and retrieve data from an external data provider B 622.
  • the results returned from the two LSOs are merged in a merge process 624.
  • the merging of the results occurs according to an intelligent data assimilation system ontology as previously described.
  • the intelligent data assimilation system allows for additional filtering of a data set once the data has been assimilated.
  • optional "opt-out" rules may be applied to the merged results allowing individuals or entities to exclude personal information from the data set that the individuals or entities do not wish disseminated.
  • An opt-out rule is an internal rule that stops certain information from being released to a service object in response to a data request.
  • the results are returned to the service request handler 608 at the end of the workflow process in the case of an instant search.
  • the data client transmits a search request with the expectation of receiving a search response within a short period of time during which the data client remains coupled to the intelligent data assimilation system.
  • the data client transmits a search request with the expectation of returning to the intelligent data assimilation system to retrieve ' the search results at a latter time.
  • FIG. 9 is a collaboration diagram depicting the interactions within an embodiment of a workflow according to the present invention.
  • the depicted embodiment of a workflow includes several optional processes that may be used alone or collectively within a workflow.
  • An intelligent data assimilation system supports the ability to group together the fulfillment of numerous services in a single request. This provides a mechanism whereby an entire order with associated line items can be sent to the intelligent data assimilation system as a single request.
  • a service request handler component creates one fulfillment request. For each request item present in the message, a service request handler 700 creates a fulfillment line row 702 in a fulfillment line table 765 and invokes a workflow instance 704 passing the fulfillment line Id as a parameter.
  • the state of a running workflow is stored in the fulfillment line.
  • the results of the search are stored in the fulfillment line.
  • intelligent data assimilation system sends a message to the workflow engine to asynchronously generate a workflow instance. All state information is maintained with the fulfillment line so that any following status requests messages can be answered. It is through the fulfillment line that an intelligent data assimilation system recognizes if a workflow instance is still processing, has failed, or is waiting for user input or the results of a manual search.
  • the invocation of the workflow instance includes a message 703 including the fulfillment line ID and the request parameters in the form of an XML document.
  • the workflow instance receives the message and starts 706 a data collection process.
  • the workflow instance sends a notification 710 to a workflow state manager 711 during a start notification node 708.
  • the workflow status manager transmits a status update message 719 to the fulfillment line so that a query of the fulfillment line will indicate that the workflow instance is working on the data request.
  • the workflow instance includes an abort node 712 within the workflow instance that waits for an external abort message 716 to be sent to the workflow instance from an external cancel request handler 714. If an abort message is received, the abort node ends 718 the data collection processes of the workflow instance and an abort message 720 is transmitted to the workflow state manager, hi response, the workflow status manager updates the fulfillment line to indicate that the workflow instance has aborted.
  • the workflow instance checks the input parameters at node 712 to determine if the parameter values are properly formed for making a request of a data provider. If the input parameter values are not properly formed, an exception is thrown 713 to an exception handler. To maintain the appropriate error state in case of failure (runtime exceptions) each workflow instance has an exception handler 715. If anything goes wrong, such as the execution of a LSO raising an exception, the exception handler is executed and the state of the workflow instance is set to "FAILED". In the case of a workflow instance failure, the exception handler sends a status failed message 717 to the workflow state manager so that the status of the workflow instance can be updated in the fulfillment line.
  • a workflow instance can call multiple LSOs either sequentially or in parallel.
  • a first LSO 728 is called at step 722 and an invocation message in the form of an XML document 724 is sent to the LSO.
  • the first LSO transmits data request messages 730 to a data provider 732 and receives requested data messages 734 from the data provider.
  • the LSO responds by sending the requested data messages back to the workflow instance.
  • the workflow instance determines if a complex search is being performed at step 738. If the search is a simple search and only the first LSO is to be called, then the workflow instance ends its search process at step 740 and transmits a status complete message 742 to the workflow state manager. If the workflow instance completed the search, a updated result XML document 764 is sent to the fulfillment line. If the workflow state manager determines that the workflow instance did not complete normally, it throws an exception 760 to the intelligent data assimilation system notifier 762.
  • a complex search such as using several LSOs to query separate databases or an LSO will be collecting data from a manual process
  • the workflow instance invokes a second LSO 744 and sends a second semantic context message 736 in the form of a XML document to a second LSO (not shown).
  • Clients using services from an intelligent data assimilation system may request a fulfillment to be performed within the same session as the request, as in the case of real-time searches.
  • an intelligent data assimilation system has services that are configured to process the request synchronously. For searches that cannot be immediately fulfilled (as in the case of manual searches) an intelligent data assimilation system has services that are configured to process asynchronously.
  • An intelligent data assimilation system therefore, has the ability to synchronize a response message to the completion of workflow and include the resulting output in a response message.
  • an intelligent data assimilation system responds to the data client independently of the completion of the workflow with the idea that the data client will "come back later" to request the results.
  • a workflow instance can call other workflow instances. To do so, a workflow instance calls 746 a sub-workflow instance 750 and transmits an invocation message 748 in the form of a XML document to the sub-workflow instance. The sub-workflow instance operates independently of the workflow instance. However, the workflow instance can wait 746 for the sub-workflow instance to finish processing.
  • the workflow instance may send an exception message in the form of an XML document 754 to the workflow instance if the sub-workflow instance encounters problems during execution.
  • the sub-workflow transmits a workflow end message 752 in the form of a XML document to the workflow instance.
  • the workflow instance receives 756 the end workflow message and merges 758 the data included in the end workflow message into a semantic object for further processing by an intelligent data assimilation system.
  • Definition of a workflow can be done by directly manipulating a service metadata store or through a product configuration tool.
  • workflow definition is achieved by defining the processing flow for LSOs included in a particular service.
  • the intelligent data assimilation system views this as a service request that passes through a set of decisions and LSOs.
  • Various workflow patterns can be developed to support searches synchronously or asynchronously and to merge the resulting output into a well-defined structure. Business rules or proprietary algorithms can be applied to the search results to support the specific nature of workflow context.
  • the workflow engine provides support for pauses in the workflow in order to perform manual intervention. Through this capability, searches that cannot be fulfilled via automation can be performed and the resulting information returned into the workflow and merged with automated searches.
  • An intelligent data assimilation system provides a notification mechanism that is triggered upon completion of a workflow instance.
  • a notification dispatcher utilizes intelligent data assimilation system services that provide e-Mail, facsimile transmission (fax), telephony, pager, DBMS Storage and JMS functionality.
  • Client applications using the intelligent data assimilation system provide notification information as part of a service request message.
  • An intelligent data assimilation system accepts multiple notification methods for any service request.
  • the e-Mail service transmits a formatted e-Mail message to an e-Mail address included in the service request message with an identification of the completed search and information on how the results can be obtained.
  • the e-Mail service can also format the search results and include them in the e-Mail message body.
  • a telephony notification service provides for notification via pagers, fax, and wireless phones.
  • an intelligent data assimilation system updates database tables with values that indicate the completion of the service request.
  • Other systems may request notification by posting a message to a message queue topic. This is accomplished through an intelligent data assimilation system's JMS service. Clients implement a mechanism to listen for messages posted to the queue. Upon receipt of a message, the data client performs request management activities and submits output requests to the intelligent data assimilation system.
  • FIG. 10 depicts an exemplary embodiment of a LSO used within a workflow according to the present invention.
  • a LSO is a self contained software object including a set of components that, when combined together, encapsulate the capability to connect to a data provider, generate a knowledgeinstance by extracting and parsing the data returned by the data provider in response to a request, and deliver the knowledge instance into the ontology for use by the current instance of the request so that the data can be consolidated into a result set.
  • LSOs are reusable components and understand the various access methods to the data provider, protocols on which a search request can be made, and the translation of returned data into a knowledge instance including atomic and semantic structures used by intelligent data assimilation system. These features form a partitioning of several layers of an LSO 500: a translation layer; a data acquisition layer; and the connectivity layer. These layers are realized by software objects termed adaptors.
  • An instantiated LSO includes a translator adaptor 502, an acquisition adaptor 504, and a connection adaptor 506.
  • the translation layer includes software objects that build a data request to a data provider 508 and subsequently parse the returned data stream 510.
  • a request builder software object 512 understands how to structure the request in the manner defined by the data provider.
  • the data provider may request a Structured Query Language (SQL) statement to be sent to a database, a query string to be sent to a Uniform Resource Locator (URL), or a proprietary query protocol.
  • a response parser software object 514 is capable of recognizing a format in which the data is being returned and translate that data into a meaningful semantic context usable by an intelligent data assimilation system and populate a run-time instance of the ontology representing the semantic and atomic objects included within the LSO .
  • the data provider may return an SQL recordset or a character delimited text file in which case the response parser will parse the recordset or character delimited text file and generate the necessary atomic objects as specified for the LSO.
  • the request builder abstracts the construction of request information by building a request at the protocol level. For example, a request builder task creates content that can be immediately fed to a connection adaptor to be executed.
  • Examples request builders are an HTTP request builder for building a request for transmission over an HTTP communications link and a Telnet request builder for transmission of a request over a Telnet communications link.
  • the request builder works on a semantic object to create request queries. This coupled with descriptor information creates a prepared request string.
  • a request builder can utilize different schemes to achieve this, hi an embodiment of a request builder according to the present invention, a request builder utilizes a native object and a native descriptor. The native object holds the implementation details of how a request looks to a particular protocol and data provider.
  • the native object is thus an interface describing an object representing a native view of a search request.
  • an HTTP request builder utilizes a URL native object.
  • the URL native object hides the details of constructing a URL query string that the HTTP protocol uses.
  • the request builder makes use of subordinate classes that perform the work.
  • the request builder then becomes like a container class to perform the action or a controllers.
  • a native descriptor includes ontology information describing a data structure. For example, if a request being sent to a data provider entails a fixed length packet, then the native descriptor includes descriptions of the data structures used to create a fixed length request. Previously described semantic descriptors are used within the native descriptor to describe the ontology information.
  • the response parser abstracts the parsing of a data stream from a data provider and construction of ontological obj ects.
  • the response parser concentrates on the format that the data stream comes back in, not the protocol that retrieved the data stream. For example, if the data stream is character delimited, binary, positional or so on, the parser will deal with those complexities and construct atomic and semantic objects used to populate the portions of arun- time instance of an intelligent data assimilation ontology included within a LSO.
  • a response parser generates ontological objects such as atomic or semantic objects. To achieve this, the parser iterates through a parser descriptor's semantic descriptor discovering new data structures to be accounted for. When a new data structure is encountered, data is extracted from the data stream and an ontological object is created and added to the runtime instance of the intelligent data assimilation ontology.
  • the parser descriptor includes ontology information describing data structures utilizing semantic descriptors. Stream and string tokenizers are also used when it is convenient to transform the data into string arrays or tokens.
  • the data acquisition layer includes a data acquisition adaptor software object used to conduct a search.
  • the data acquisition adaptor manages the translation of parameters into a search request 516, supplies the request to the appropriate connectivity protocol in the connectivity layer, obtains the results 518, manages the translation of the data stream into a semantic XML structure and returns the results into the intelligent data assimilation system.
  • a data acquisition adaptor initializes a request builder, a response parser, and a connection adaptor. Once that happens, the adaptor has the task of interacting with each of these components in order to build a request, send a query to a data provider and parse a reply and populate an ontology with a knowledge instance generated from the reply. For example, an acquisition adaptor at initialization time generates a request builder, a response parser, and a connection adaptor. The data acquisition adaptor "builds" a request by making a method invocation on the request builder. The data acquisition adaptor takes the output from the request builder and feeds the response to the connection adaptor.
  • connection adaptor When the connection adaptor returns, the output, such as an input stream, is fed into the response parser where semantic and atomic objects are made from the input stream. The semantic and atomic objects are then passed back to the LSO which populates the intelligent data assimilation ontology.
  • the connectivity layer includes a connection adaptor for transmitting datapackets 520 and receiving data packets 522 to and from an external data provider 521 using various communication and data provider access protocols.
  • An intelligent data assimilation system is capable of connecting to external data providers using a variety of protocols such as HTTP, various RDBMS protocols such as Oracle Net or SQL Net, and other protocols proprietary to specific data providers.
  • the intelligent data assimilation system understands that a request via HTTP may entail that a socket be established or a request to a database may entail making particular drivers available.
  • the connectivity layer of an LSO provides this capability. The request can then subsequently be passed to the data provider, where it is executed and search results obtained.
  • connection adaptor hides transport specific headers found within a data stream.
  • the connection adaptor also concatenates responses from a data provider. For example, if a search produces three results then the data stream returned should include the three results concatenated together.
  • a connection adaptor establishes a link between the connection adaptor' s environment and an external data provider. For example, a connection adaptor knows the external data provider ' s JP address, port number, and whatever information is entailed in contacting the data provider so that data can be sent.
  • a connection adaptor utilizes a plurality of subordinate classes to accomplish the connection adaptor's tasks. The responses from the external data provider are included in an input stream.
  • a connection to the data provider is setup before runtime so that the connection adaptor can send the request data to the external data provider without re-establishing a connection with the data provider for each request.
  • an LSO is implemented using Enterprise Java Bean (EJB) wrappers providing the LSO with the ability to deploy various combinations of data provider interface components into the LSO at runtime.
  • EJB Enterprise Java Bean
  • FIG. 11 is a diagram depicting the layered architecture of an embodiment of a generic LSO according to the present invention and how this layered architecture is used to generate different LSOs with various connectivity capabilities.
  • the operation between obj ects within the layers has been previously described in FIG. 10.
  • a LSO includes a translator layer
  • the translator layer transmits workflow task data requests down through the acquisition adapter layer, through the connection adapter layer, through the connectivity layer to the data provider.
  • the data provider responds to the data request by accessing a data store 503 to satisfy the data request and transmits data satisfying the request to the LSO.
  • the LSO then reformats the requested data and builds previously described semantic obj ects included in the ontology that are used by an intelligent data assimilation system to respond to data requests from external data clients as previously described.
  • the acquisition layer includes an EJB wrapper sub-layer 526 encapsulating previously described data access objects
  • a database LSO 500 includes database data access objects 505 including Java DataBase Connectivity (JDBC) API to execute SQL statements, thus allowing the database LSO to interact with a SQL-compliant database.
  • JDBC Java DataBase Connectivity
  • the JDBC data access objects use a JDBC connection pool 507 within the language specific protocol sub-layer and a networked database driver 509 in the high-level protocol sub-layer to connect to an external database using a physical communication medium such as a fixed network link 511 or a telephone dial-up link 513 through a TCP/IP low-level protocol sub-layer 547.
  • a physical communication medium such as a fixed network link 511 or a telephone dial-up link 513 through a TCP/IP low-level protocol sub-layer 547.
  • a Web server LSO suitable for connection to an external data provider in the form of a Web server 539
  • a Web server LSO includes HTTP data access objects 515 for connection to a data provider over a communications network adapted for communications using HTTP .
  • the HTTP data access obj ects use a HTTP connection pool 517 within the language specific protocol sub-layer and HTTP driver API 519 in the high-level protocol sub-layer to connect to an external Web server using a physical communication medium such as a fixed network link 511 or a telephone dial-up link 513 through a TCP/IP low-level protocol sub-layer 547.
  • aFTP server LSO suitable for connection to an external data provider in the form of aFTP file server 541
  • aFTP server LSO includes FTP data access objects 521 for connection to a data provider over a communications network adapted for communications using FTP.
  • the FTP data access objects use aFTP connection pool 523 within the language specific protocol sub-layer and FTP driver API 525 in the high-level protocol sub- layer to connect to an external FTP server using a physical communication medium such as a fixed network link 511 or a telephone dial-up link 513 through a TCP/IP low-level protocol sublayer 547.
  • a file server LSO includes file data access objects 521 for connection to a data provider's file system.
  • the file data access objects use file I/O API 523 within the language specific protocol sub-layer and file system API 525 in the high-level protocol sub-layer to connect to a file system 543 stored on a physical medium such as a floppy disk 551 or magnetic tape 553.
  • a manual entry LSO includes que list data access objects 521 for management of the manual entry data.
  • the manual entry access objects use an application 537 within the language specific protocol sub-layer and a user interface in the form of forms and screens 539 in the high-level protocol sub-layer to connect to a human actor 545 through a manual entry input device such as a screen and keyboard 549.
  • An application server provides an interface with an intelligent data assimilation system by accepting service requests from external systems and responding with fulfilled service requests.
  • An application server is also that process that ensures all service requests received by the intelligent data assimilation system are reliably executed.
  • An application server accepts numerous service requests that either request an informational response about the state or information managed by the intelligent data assimilation system or service requests to initiate a service request by instantiating a workflow. While the workflow mechanism supports the ability to create automated workflows including a plurality LSOs without any manual intervention, the workflow mechanism also supports the ability to suspend a workflow and allow for a manual process to occur that will provide data back into the system and then resume the workflow execution.
  • Service requests are submitted to an intelligent data assimilation system through the intelligent data assimilation system's exposed API. These service requests are fulfilled against different data providers. Once the service request is fulfilled, the intelligent data assimilation system will optionally host the result until it is requested by the specific system that submitted the service request. Search results can remain in the intelligent data assimilation system for a configurable period of time.
  • the intelligent data assimilation system API structure is built upon the ability to accept XML messages via a variety of transport mechanisms.
  • the intelligent data assimilation system' s processing tier handles the following transports: • Java Messaging Service (JMS)
  • RMI Remote Message Interface
  • SMSTP Simple Mail Transfer Protocol
  • the processing tier accepts documents sent according to each of the transport protocols.
  • Each of the documents includes a XML document including a service request.
  • the intelligent data assimilation system provides an EJB wrapper to various intelligent data assimilation system proxies that provide an interface to build XML messages and send the XML messages via the requested transport.
  • Configuring a deployment descriptor file sets the transport method utilized by the wrapper component.
  • a data client accessing an intelligent data assimilation system's services can choose to use the wrapper component to facilitate the building and sending of messages or may alternatively choose to communicate directly to the intelligent data assimilation system proxy.
  • Each message sent to the logical search object application server results in obtaining a response from the intelligent data assimilation system coupled to the intelligent data assimilation system.
  • the type of response depends on the kind of submitted message.
  • the messages used include: a service request, a status request, a service information request, and a cancellation request.
  • the messages are transmitted to the intelligent data assimilation system by a data client as an XML document.
  • Each message includes an outer message wrapper including attributes which help identify the type of a message, the source of a message, and the uniqueness of a message from the perspective of a data client.
  • a service request message initiates a request for service in an intelligent data assimilation system.
  • the service request message includes two outer elements 'message' and 'data'.
  • the 'message' element includes attributes to identify the origin and type of message.
  • the 'data' element includes information about what services have been requested.
  • a service request may include one or many requests for service, with one by the 'product' element for each request.
  • the product element includes a number of attributes indicating which services have been requested. Inside the 'product' element there may be multiple tags indicating parameters for the request, formatting options and notification options.
  • the following table includes a description of the elements in a service request message:
  • the exemplary XML document service request message includes an outer message element 1101 and an inner data element 1102 as previously described. Included in the data element is a service-request element 1104 including a service-item element 1106.
  • the service- item element includes two parameter elements, 1108 and 1110, defining a first and last name as part of the search request.
  • the data element further includes a notification element 1112 specifying the type of notification to be used to notify the data client when a data product resulting from the service request is ready.
  • the data element further includes a result-format element for specifying the format of the data product 1114.
  • a results element is included in the data element as a place holder for the resulting data product to be sent to the data client in response to the service request.
  • a status request message is a request for an intelligent data assimilation system to provide status information for a previously sent service request.
  • the following table includes a description of the elements in a status request message:
  • FIG. 15 is an XML document embodiment of an exemplary status message according to the present invention.
  • a message element 1118 includes a data element 1120 as previously described.
  • the message element includes a type attribute 1119 indicating that this is a status message.
  • the data element includes a status element 1122 which includes a plurality of status- request elements 1124, each with its own item and identifier attributes.
  • a cancel request message is a request for an intelligent data assimilation system to stop processing a previous service request. It is possible that before an intelligent data assimilation system has a chance to process the cancel request message, the processes to be cancelled may have already completed.
  • cancel Element which wraps the individual cancel requests.
  • cancel #message-id A unique identifier used to identify this cancel message cancel-request Element which represents a individual cancel request item.
  • cancel-request# ⁇ tem An identifier used to differentiate between cancel requests in the same message idas-id The identifier of the intelligent data assimilation system request a data client is cancelling.
  • FIG. 16 is an XML document embodiment of an exemplary cancel message according to the present invention.
  • the XML document embodiment of an exemplary cancel message includes a previously described message element 1126 including a previously described data element 1128.
  • the type attribute of the message element is "Cancel" 1127 indicating that this is a cancel message.
  • the data element includes a cancel element 1130 which includes a plurality of cancel request elements 1132.
  • An output message is transmitted by a data client to an intelligent data assimilation system to request the results of a previously transmitted service request.
  • the following table includes a description of the elements in an output request message: output Element which wraps the individual output requests.
  • output#message-id A unique identifier used to identify this output message- output-request Element which represents a individual output request item.
  • output-request#item An identifier used to differentiate between output requests in the same message idas-id The identifier of the intelligent data assimilation system process you wish to retrieve data from.
  • format Element indicating a formatting option to be used.
  • format#style The XML stylesheet to used to format the output.
  • FIG. 17 is an XML document embodiment of an exemplary output message according to the present invention.
  • the XML document embodiment of an exemplary output message includes a message element 1134 including a data element 1136 as previously described.
  • the data element includes an output element 1138 including an output-request element.
  • the output- request element includes an identifier attribute 1141 identifying the service request for which a data client requests a result.
  • a format element 1142 specifies the format of the requested data using a style attribute 1144.
  • a response message represents a response given to a request to the intelligent data assimilation system if the request is correctly structured and formatted.
  • the response includes product data, search results data or status data. It also includes indicators whether the request was successful or not.
  • FIG. 18 is an XML document embodiment of an exemplary response message according to the present invention.
  • the XML document embodiment of an exemplary response message includes a previously described message element 1146 including, a previously described data element 1148.
  • the message element includes a type attribute 1149 indicating that the message is a response message.
  • the data element includes a service-response element 1150 including a response element 1152.
  • the response element includes a response- value element 1154 which indicates the date and time that a workflow task was instantiated to satisfy a subject service request.
  • a selection message includes the response to a user selection. It is issued when a user makes a selection from a list of options, and the message encapsulates the result.
  • the following table includes a description of the elements in a selection message:
  • a product information message is used to request information about one or more services provided by the intelligent data assimilation system.
  • a response to a product information message includes information describing the service, available formatting options, an intelligent data assimilation system ontology used for the input parameters and the intelligent data assimilation system ontology for the expected output.
  • data clients access the services of the intelligent data assimilation system via an 0 application server.
  • the application server provides a consistent interface for communications with data clients over a variety of communication networks.
  • the application server receives search request messages from data clients via a plurality of communication networks, translates the search request messages into a consistent internal format for use by the intelligent data assimilation system, and coordinates the transmission of responses to the data clients via the c communication networks .
  • FIG. 12 is a sequence diagram illustrating the sequence of events that occur when a request message is received by an embodiment of an application server according to the present invention.
  • An application server includes a processing tier 1000 and a server tier (not shown).
  • the processing tier includes a protocol specific message extractor 1004, a request processor 0 1010, and a request to event translator 1014.
  • a data client transmits 1002 a protocol specific message 1003 including a previously described request message in the form of a XML document.
  • the data client might be a Web browser and the protocol specific message might be a post transmitted via the Internet using HTTP.
  • the supported protocols include HTTP, SMTP, RMI, and 5 " JMS.
  • the protocol specific message is received by a protocol specific extractor 1004 and the protocol specific extractor extracts 1006 the embedded XML document request message 1008.
  • the XML document request message is transmitted to a request processor 1010.
  • the request processor parses 1009 the XML document request message and extracts attributes in order to 0 determine the class of event handler to be instantiated.
  • the request handler then creates the event handler with the extracted attributes and the XML document request message.
  • the XML document request message 1012 then propagates up to the server tier where it is broken into various components 1016 for the workflow by the request to event translator.
  • FIG. 13 illustrates how an embodiment of an intelligent data assimilation system is 5 coupled to a according to the present invention.
  • An application server includes a server tier including a server controller for controlling the operations of the intelligent data assimilation system.
  • Operations within an intelligent data assimilation system are initiated by a plurality of server events 902 generated within the previously described processing tier 1000 in response to request messages transmitted to the application server by a data client.
  • a state manager 905 coordinates the actions of a plurality of server event handlers 908 responding to the plurality of server events.
  • the server event handlers in turn use the services of the previously described workflow engine 910 to create a instances of previously described workflows.
  • the workflows include a task node 912 that invokes a previously described LS O 914.
  • the properties of the LS O are defined by a set of previously described descriptors stored in the previously described metadata store 916.
  • the LSO includes a previously described translator 918 for interfacing with the workflow task and a previously described acquisition adapter for building queries and parsing responses, and a previously described connection adapter 922 for connection to an external data provider.
  • the LSO makes a connection to an external data provider such as data providers 926, 930, and 934 using an appropriate communications link such as communications links 924, 928, and 932 as previously described.
  • the LSO collects a data set and creates a semantic object encapsulating the data set as previously described.
  • the workflow uses the semantic object to transmit a data result to results data store 938.
  • the intelligent data assimilation system manages the data results in XML format in the results data store.
  • a product definition identifies the duration that the data result will remain in the results data store.
  • Data results that persist in the intelligent data assimilation system can be used for archival purposes.
  • the data result is rendered into an appropriate format for transmission to the data client using XSL stylesheets by the formatter 940 according to a format tag included in the request message.
  • the server controller receives the rendered data result from the formatter and transmits the rendered data result 942 to the data client as a response to the data client's service request message.

Abstract

An intelligent data assimilation system (Figure 3) that includes mechanisms and subsystems to create and manipulate a repopsitory of logical search objects (Figure 3, 102) encapsulating knowledge and capabilities used to execute a search for data from a plurality of data providers (104) by utilizing well defined ontologies (112).

Description

METHOD AND APPARATUS FOR INTELLIGENT DATA ASSIMILATION
BACKGROUND OF THE INVENTION
This invention relates generally to the field of servers coupled to computer networks and more specifically to servers used as search engines.
Web servers are software applications hosted by computer systems operably connected to the Internet. Web servers receive requests from Web clients and provide responses in the form of documents written in a document markup language such as Hyper Text Markup Language (HTML). The first Web servers operated as connections to file systems and served static HTML documents stored as files within a computer system.
The static HTML documents were quickly replaced by programs that accept as input parameters a client request and generate as output a document. These programs are typically written in a variety of interpreted procedural languages such as Perl and acquired procedural languages such as C. The use of document generating programs also introduced new capabilities. Data could be drawn from any source and used to create a document. For example, databases could be queried and the results used within a document. Furthermore, the document generating programs could create side effects during the execution of a document generating program. For example, a typical side effect may include creation and storage of tracking statistics for later use and analysis by a Web site's host.
The use of document generating programs created a bottleneck within a computer system hosting a Web server. Each document generating program ran in its own process and the constant creation and destruction of the required processes created unacceptable overhead requirements. These overhead requirements resulted in the creation of application servers capable of invoking document generation programs without creating new processes.
In one type of such an application server, the application server comprises an interpreter and the document generating programs are written in a proprietary programming language, hi the case where the proprietary programming language is a procedural language, the application server reads and runs the document generating programs without invoking a new process. In the case where the proprietary programming language is an object oriented language such as Java, each document generating program is a software object known as a servlet. In this case, the Web server invokes the servlets as needed and the servlets handle the remaining document generation functions.
Previous development of Web servers focused on producing the most efficient use of computing resources to produce finer and finer vertical slices through a Web site. For example, a servlet once invoked includes all of the necessary logic necessary to acquire data according to a client request, format the acquired data into a document, and report back to the hosting system any statistics about the client/servlet interaction. Finer and finer vertical slices through a Web site leads to an unscalable Web server architecture because all the logic for acquiring data, generating a document, and producing side effects is included within a monolithic software object thus defeating the ability to reuse or integrate a particular software object in another system or software object.
Therefore, a need exists for a scalable Web server architecture that features a high degree of reusability of a Web server's components. The present invention meets such a need.
SUMMARY OF THE INVENTION
In one aspect of the invention, a method is provided for fulfilling a data service request.
The method includes providing an ontology description of a data service a first logical search object operably coupled via a first communications link to a data provider. The first logical search object transmits to the data provider via the communications link a search request generated by the first logical search obj ect from the data service request. The first logical search object receives from the data provider via the communications link a data set in response to the search request. The first logical search object generates a knowledge instance from the data set using the ontology description.
In other aspects of the invention, the method for fulfilling a data service request is adapted for a plurality of data providers including a database server, a FTP server, a Web server, a file system, and a human data provider. Additionally, the method is adapted for a communications protocol proprietary to a data provider.
In another aspect of the invention, the data service request is included in a XML document. In another aspect of the invention, the method for fulfilling a data service request further includes providing a first workflow operably coupled to the logical search obj ect and transmitting by the first logical search object to the first workflow the knowledge instance.
In another aspect of the invention, the method for fulfilling a data service request further includes providing a second logical search object operably coupled to the first workflow, with the first workflow encapsulating the ontological relationship between the first and second logical search objects.
In another aspect of the invention, the method for fulfilling a data service request further includes providing an application server operably coupling a data client to the first workflow via a second communications link. The application server receives from the data client via the second communications link a data service request message with the data service request included in the data service request message. The application server transmits to the first workflow the data service request message and the first workflow transmits to the logical search object the data service request message. hi other aspects of the invention, the second communications link is adapted for communications using SMTP, JMS, HTTP, or RMI. In another aspect of the invention, the method for fulfilling a data service request further includes providing a formatter and formatting by the formatter the data set encapsulated in the knowledge instance into a format requested by the data client. hi another aspect of the invention, the method for fulfilling a data service request further includes providing a second workflow operably coupled to the first workflow. another aspect of the invention, a data processing system is adapted to access a data provider via a communications link. The data processing system includes a processor and a memory operably coupled to the processor and having program instructions stored therein, the processor being operable to execute the program instructions. The program instructions include receiving by a software object a search request message document and generating by the software object a data request for the data provider from the search request message document. The software obj ect transmits to the data provider the data request via the communications link. The software object receives from the data provider a data set via the communications link and generates a semantic object from the data set.
In another aspect of the invention, s data processing system adapted to access a data provider via a communications link includes a parser adaptor operably coupled to a software object. A parser semantic description of the data set and a semantic object semantic description is provided for use by the parser adaptor. The parser adaptor generates extracted data from the data set using the parser semantic description. The parser adaptor then generates a semantic object using the extracted data according to the semantic object semantic description. In another aspect of the invention, a data processing system adapted to access a data provider via a communications link includes a request builder operably coupled to a software object. A native object is operably coupled to the request builder with the native object encapsulating implementation details of a data request for the data provider. A native semantic description is provided including ontology information describing a data structure used by the request builder to build a data request for the data provider. The request builder transmits to the native object a search request and the native object generates a data request from the search request using the native semantic description. BRIEF DESCRIPTION OF THE DRAWINGS
These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description and accompanying drawings where: FIG. 1 is a deployment diagram of an intelligent data assimilation system according to the present invention;
FIG. 2 is a hardware architecture diagram of a general purpose computer suitable for use as an intelligent data assimilation system host;
FIG. 3 is an architecture diagram illustrating an embodiment of an intelligent data assimilation system according to the present invention; FIG.4 is a diagram of the levels within one embodiment of an intelligent data assimilation system ontology according to the present invention;
FIG. 5 is a diagram depicting an exemplary embodiment of an ontological relationship between semantic constructs and their associated logical search objects in an ontology according to the present invention; FIG.6 is a diagram depicting an exemplary embodiment of how service classes are created using the ontological relationships of logical search objects according to the present invention; FIGS. 7a and 7b are diagrams depicting how semantic objects are described within the metadata store;
FIG. 8 is a diagram depicting the operations of an embodiment of an exemplary workflow according to the present invention;
FIG. 9 is a collaboration diagram depicting the interactions within an embodiment of a workflow process according to the present invention;
FIG. 10 depicts an exemplary embodiment of a logical search object used within a workflow according to the present invention; FIG. 11 is a diagram depicting the layered architecture of an embodiment of a generic logical search object according to the present invention and how this layered architecture is used to generate logical search objects with various connectivity capabilities;
FIG. 12 is a sequence diagram illustrating the sequence of events that occur when a request message is received by an embodiment of an intelligent data assimilation system according to the present invention;
FIG. 13 illustrates an intelligent data assimilation system employing logical search objects according to the present invention;
FIG. 14 is an XML document embodiment of an exemplary service request message according to the present invention; FIG. 15 is an XML document embodiment of an exemplary status message according to the present invention; FIG. 16 is an XML document embodiment of an exemplary cancel message according to the present invention;
FIG. 17 is an XML document embodiment of an exemplary output message according to the present invention; and FIG. 18 is an XML document embodiment of an exemplary response message according to the present invention.
DETAILED DESCRIPTION OF THE INVENTION FIG. 1 is a deployment diagram of an intelligent data assimilation system according to the present invention. An intelligent data assimilation system host 10 is operably coupled to a communications network 14, such as the Internet, via a communications link 12 adapted for communications using a variety of transport protocols such as Transmission Control Protocol/Internet Protocol (TCP/IP). The logical search object host communicates with a data provider host 22 operably coupled to the communications network by a communications link 20 adapted for communications using TCP/IP . The data provider host is operably coupled to a data store such as, but not limited to, a data provider database 24. A data client host 18 is operably coupled to the intelligent data assimilation system host via a communications link 16 adapted for communications using TCP/IP. In operation, a data client 26 hosted by the data client host transmits a service request message 27 to an intelligent data assimilation system 101 hosted by the intelligent data assimilation system host. The service request message includes search parameters that the intelligent data assimilation system uses to generate and transmit a query 29 to a data provider server 28 hosted by the data provider host. The data provider server responds to the query by generating and transmitting a data set 31 back to the intelligent data assimilation system. The intelligent data assimilation system populates the ontological instance of the returned data and formats the data set into a formatted data result 33 transmitted to the data client as a response to the service request message.
FIG. 2 is a hardware architecture diagram of a general purpose computer suitable for use as an intelligent data assimilation system host. A processor 300, comprising a Central Processing Unit (CPU). 310, a memory cache 320, and a bus interface 330, is operatively coupled via a system bus 335 to a main memory 340 and a I/O control unit 345. The I/O control unit is operatively coupled via a I/O local bus 350 to a disk storage controller 395, and a network controller 380. The disk storage controller is operatively coupled to a disk storage device 325. Computer program instructions 397 for implementing an intelligent data assimilation system are stored on the disk storage device until the microprocessor retrieves the computer program instructions and stores them in the main memory. The microprocessor then executes the computer program instructions stored in the main memory to implement the features of an intelligent data assimilation system. The network controller is operatively coupled to a communications device 396. The communications device is adapted to allow an intelligent data assimilation system hosted by the general purpose computer to communicate via a computer network such as the Internet with other software objects on the computer network.
FIG. 3 is an architecture diagram illustrating an embodiment of a an intelligent data assimilation system according to the present invention. An intelligent data assimilation system is coupled to a computer network through the services of an application server (not shown). The intelligent data assimilation system is an object-oriented component-based server platform providing mechanisms and subsystems to create and manipulate a repository 102 of Logical Search Objects (LSOs) that encapsulate knowledge and capabilities used to execute a search for data from a plurality of data providers 104. The intelligent data assimilation system is accessed by a data client via the communications network through the application server.
The data providers are logically coupled to the intelligent data assimilation system using a variety of methods. The data providers provide data sets including various data products from data providers such as data aggregators, search robots or agents, HTML-based search engines, in-house databases, or any other mechanism capable of providing a mechanism to extract electronic information. Additionally, a data provider can be a human actor receiving a request from the intelligent data assimilation system to find and return a data set that is not electronically accessible. For example, a private investigator may be used to collect data for a data set such as confirming the physical location of a person. A LSO 106 is a reusable, configurable and self-contained software component capable of establishing connectivity to a particular data set provided by a data provider. Furtheπnore, a LS O includes logic for qualifying or filtering a data set and delivering the resulting information as a document written in extensible Markup Language (XML) or as an intelligent data object herein termed a semantic object 108. Additionally, a LSO reveals its capabilities and information to other software components for use by those software components.
The complete definition of a LSO is stored in a database 110 as a metadata description. At runtime, when a specific instance of a particular LSO is called for, an LSO instance is created from the LSO's metadata definition. However, the intelligent data assimilation system has the ability to pre-instantiate LSOs if needed. The intelligent data assimilation system provides data mapping tools to add semantic intelligence to atomic data with the purpose of providing users the ability to manipulate LSO results. By abstracting (grouping) various fields together into a well-defined ontology 112, an intelligent data assimilation system provides abstracted views of data allowing users to manipulate encapsulated data. For example, an intelligent data assimilation system can group a LSO called "First Name" 114 and a LSO called "Last Name" 116 into a semantic object 118 called "Name", and then group "Name" with another semantic obj ect called "Address" 108 into a semantic object called "Person" 120. This semantic structure allows intelligent traversal and identification of any available information.
By revealing all their intelligence on connectivity, such as ontological data and cost, the LSOs provide a mechanism for the logical search object subsystems to provide tools 122 for the definition of services that may be requested from the intelligent data assimilation system by a data client. The services generate results in the form of data products including data requested by the data client. Through the use of a graphical user interface, designers view all the available LSOs and semantic objects in a repository and manipulate LSOs and select desired LSOs or semantic objects and place LSOs and semantic objects into workflows. Each workflow typically represents a single service offering. By placing one or many
LSOs into a workflow, a designer creates workflows resulting in a service. Some of the features of these workflows include the ability to run multiple LSOs in parallel, define failover LSOs, and define contingencies where the output of one LSO can be the input of another object.
An ontology designer 122, is used to create the data mapping of data from a data provider into an intelligent data assimilation system ontology. It is through this process that the intelligent abstractions of data are performed to create semantic obj ects used by the LSOs maintained in the intelligent data assimilation system repository.
A product configurator 122 is used to browse and select semantic objects and apply workflow definitions with the purpose of creating intelligent data assimilation system services. The product configurator provides robust productivity features such as drag and drop functionality and wizard style forms.
The product configurator surfaces the information encapsulated in semantic objects. This allows the designer to manipulate data as simple atomic data or highly abstracted complex data. Data encapsulated in a LSO is mapped as a single semantic object, thus the designer is able to place that semantic object representing a particular LSO into a workflow. However, semantic objects are not constrained by LSOs, the product configurator allows the designer to manipulate even higher abstracted semantic objects that straddle multiple LSOs and thus the product configurator derives which LSOs are to be inserted into a workflow for a given service and what constraints, if any, exist between the LSOs encapsulated in a semantic object. With LSOs encapsulating information about connectivity, cost, query mechanism, and ontology, the product configurator is able to manipulate abstractions of data and place those items into a workflow. hi addition to the ability to choose and place semantic obj ects into a workflow the product configurator also provides the designer the ability to configure other attributes about a data product generated by the service including input display attributes, input rules, result extensible Style Language (XSL) stylesheets, and other attributes that define the data product.
When the service definition is complete, the designer has the capability to iteratively test and enhance the service in real-time, using either live or simulated data, and see the results in real-time and when satisfied that the service is complete, save the service to the intelligent data assimilation system metadata store and thus have a real time deployment of the service. The intelligent data assimilation system consolidates the results of all LSOs executed in a workflow and represents the resulting data as XML documents that can be delivered to a data client in that form, hi one embodiment, the intelligent data assimilation system reformats the XML documents to create formatted outputs 124 using XSL stylesheets 125. These XSL stylesheets provide for the rendering of formatted data, including the simultaneous delivery of results in HTML 126, XML, Wireless Application Protocol (WAP) 128, Rich Text Format
(RTF), Word (DOC), WordPerfect (WPS), or other formats 130.
An intelligent data assimilation system ontology facilitates the exchange of data between multiple computer systems independently of the individual system technologies, information architectures and application domain, and allows the creation of groupings of atomic data fields into intelligent building blocks termed semantic objects allowing the construction of services provided by the intelligent data assimilation system.
An ontology includes a vocabulary of terms and a specification of what those terms mean. The intelligent data assimilation system ontology provides a set of well-founded constructs that are used to build meaningful higher-level knowledge. Basic terms in the intelligent data assimilation system ontology are selected such that basic foundational concepts and distinctions are defined and specified. The basic terms chosen form a complete set, whose relationship to one another is defined using formal techniques. These formally defined relationships provide the semantic basis for the terminology chosen. It is these relationships included in the intelligent data assimilation system ontology that enables the expression of domain-specific knowledge without including domain-specific terms.
An intelligent data assimilation system ontology includes definitions of abstractions. For example, a person semantic is constructed by the underlying atomic or semantic items, first name, middle name, and last name. The definition is used to create a knowledge instance of the ontology, for example when at runtime the ontology is populated with specific semantic and atomic instances returned by the LS Os .
The life span of a definition and a knowledge instance vary. Definitions included in an intelligent data assimilation system ontology are persistent, reused extensively, and have global scope within an intelligent data assimilation system. A knowledge instance created at run time is either transient or persistent. Within the context of the intelligent data assimilation system, requests are typically transient because lαiowledge is persisted into the ontology only for the duration of a particular request instance and then discarded. For example, data is retrieved from a data provider, a portion of the ontology corresponding to LSOs in a workflow is populated using a run-time knowledge instance, and only the knowledge instance created within a particular request instance is processed and once processed the knowledge instance is discarded from the ontology. In one embodiment of an intelligent data assimilation system according to the present invention, knowledge instances are persistent such that the knowledge instances can be later queried from the intelligent data assimilation system and assimilated into another service request. A persistent knowledge instance thus has global scope as described above.
Commerce agents using the intelligent data assimilation system are adaptable and able to communicate domain-specific knowledge. Commerce agents do this by using the basic terms and relationships defined in the intelligent data assimilation system ontology and through these definitions of the basic terms the intelligent data assimilation system allows the basic terms to be grouped (abstracted) to form higher-level intelligent constructs.
In one embodiment of an intelligent data assimilation system according to the present invention, an intelligent data assimilation system ontology is stored in a metastore as an XML definition and is represented at many different levels using its XML definition.
An intelligent data assimilation system enforces security for all data clients. The fundamental mechanism for performing security is name and password authentication against the Java Naming and Directory Interface (JNDi) tree in the intelligent data assimilation system. Messages sent to intelligent data assimilation system across a protocol method, such as Java Messaging Service (JMS), Remote Method Invocation (RMI), or Hyper Text Transfer Protocol
(HTTP), have name and password authentication level security applied. The intelligent data assimilation system also supports the use of digital signatures. Data sent to the server may include a digital signature that can be verified against a checksum generated from the request and a stored public key. The intelligent data assimilation system also supports data encryption. Once the intelligent data assimilation system has accepted a service request message at a processing tier, the intelligent data assimilation system secures the service request against available services for that customer. Clients do not have access to services not available to them.
An intelligent data assimilation system also enforces the return of results to the originating data client. This type of security restricts access to search results and allows access to data by those data clients that submitted a corresponding service request.
The intelligent data assimilation system supports a notification mechanism that allows for multiple notifications for any given request. This feature allows the intelligent data assimilation system to respond to a service request by sending results to one or all of a user's email, cell phone, web page etc.
The intelligent data assimilation system enforces business rules that relate to particular business processes. These business rules are abstracted into a generalized mechanism through which an intelligent data assimilation system conditionally evaluates and determines an appropriate action. The intelligent data assimilation system supports the ability to perform rules validation at various levels of the intelligent data assimilation system.
The intelligent data assimilation system rule architecture is open such that there are well defined exit points that are expected to return boolean values that indicate the success or failure of any rule. These rules are implemented by deployment of any business process, be it a manual intervention, a proprietary algorithm, or an off-the-shelf rules engine.
The intelligent data assimilation system architecture is designed to take advantage of current enterprise development standards. In one embodiment of an intelligent data assimilation system according to the present invention, components of an intelligent data assimilation system adhere to the Java 2 Enterprise Edition (J2EE) standard for developing multi-tiered enterprise applications.. J2EE simplifies enterprise applications by basing them on standardized, modular components, by providing a complete set of services to those components, and by handling many details of application behavior automatically, without complex programming. This embodiment of an intelligent data assimilation system also takes advantage of many features of the J2EE standards such as software portability, the JDBC API for database access, and a security model that protects data even in Internet applications. hi one embodiment of an intelligent data assimilation system according to the present invention, the underlying platform for these components is a J2EE compatible application server. A suitable application server is BEA Weblogic Application Server. The capabilities of BEA's
Weblogic Application Server include a distributed transaction manager, Java Messaging Services (JMS), support for extensible Markup Language (XML), and J2EE standards support.
In one embodiment of an intelligent data assimilation system according to the present invention, an intelligent data assimilation system uses a BEA Weblogic Process Integrator (WLPI) to implement workflow tasks. WLPI provides the intelligent data assimilation system the ability to design and automate business processes that integrate data provider applications, search services, and human intervention. hi an embodiment of an intelligent data assimilation system according to the present invention, an Oracle DataBase Management System (DBMS) provides the intelligent data assimilation system with persistence capabilities.
FIG. 4 is a diagram of levels within an exemplary portion of one embodiment of an intelligent data assimilation system ontology according to the present invention. The intelligent data assimilation system ontology manages three levels of data abstraction. Initially, raw data fields 140 in a data set are positionally mapped from a data provider into atomic objects such as person atomic object 146 and credit atomic object 148. Atomic objects add type and basic naming information to the data fields. Intelligence about the meaning of the data fields included in the atomic objects is created through the semantic object abstractions in the semantic level 142.
The intelligent data assimilation system views data fields as the most basic elements of any data set retrievable by the intelligent data assimilation system. Prior to the ontological definition, these data fields generally include no semantic context or intelligence. Character or byte strings, numbers, decimals and dates can be identified; however, no meaningful definition is applied providing understanding of the data. For example, position 1 to 9 in a data field may include the value "561929975" but a typical data assimilation system does not understand whether this is a social security number, order number, or product number. Associating a semantic context to the data field by an intelligent data assimilation system results in an atomic object.
Atomic obj ects have a semantic context associated with a value of a particular type. Thus, data fields can be translated into an atomic object. For example, the value might be a string "John 10/9/2000 21:23.234 8A56FB" and include semantic information in the format: First Name, Date, and Result of Query JJD.
Semantic objects are groupings of atomic objects, defined in the intelligent data assimilation system ontology, which have a particular business meaning. Intelligence is created by these higher-level groupings. For example, the following atomic objects: first name 147, middle initial 151, and last name 149 can be grouped to provide a "person name" semantic object 150. Additionally, the following atomic objects: addressl 153, city 155, state 157, and zip code
159 are grouped to create a "person address" semantic object 152. Subsequently, a higher-level semantic object can be created by combining a "person name" with a "person address" to build a "person" semantic object 156. In a similar fashion, a credit file atomic object 148 is used to create a credit score semantic obj ect 154. The credit score semantic obj ect is then used to create a higher-level credit risk semantic object 158. Semantic objects are grouped in a data service level 144 of the intelligent data assimilation system ontology to create high-level data abstractions used by other sub-systems within in the intelligent data assimilation system.
FIG. 5 is a diagram depicting an exemplary embodiment of an ontological relationship between semantic constructs and their associated LSOs in an ontology according to the present invention. The exemplary embodiment depicts a legal entity class 200 with two children classes, person 218 and corporation 202. Each of these classes includes a child class encapsulating the concept of where an instantiation of the legal entity class may live as defined for legal purposes.
In the case of the person class, the appropriate class is the domicile class 220 and for the corporation the appropriate class is the corporate address class 204. The classes street address 212, city 210, Zip Code 208, and state 206 have multiple parents through multiple inheritance, namely the domicile and corporate address class.
The street class further includes two methods, method 1 214 and method 2 216, for retrieving a street address. These methods can be addressed and used by an object sending a request to an instantiation of the domicile or corporate address class in order to customize the data provider for retrieving a street address. For example, a request for data about a person's domicile may include a request for a street address from tax records. Another request for data about the same person's domicile may include a request for a street address pulled from vehicle registration records, hi each case, the data provider is different arid a different method may be used to request and receive data from the different data providers.
In operation, instantiations of the domicile class or the corporation class answer requests for returning the legal address of aperson or a corporation respectively. For example, if a request is made to find the legal address of a person when the system receives the appropriate request, it invokes a workflow that will instantiate the appropriate LSOs to fetch the requested data. Each of these objects contacts a data provider and requests and receives the appropriate data from the data provider. Associated with this workflow an instance of the ontology will be invoked and when the data has been retrieved, any node, (such as domicile in the example), in the instantiated ontology is able to represent the underlying aggregation of data for that particular node in the ontology. Thus, the domicile object with its ability to aggregate the received data, is capable of representing the person's address.
FIG.6 is a diagram depicting an exemplary embodiment of how service classes are created using the ontological relationships of LSOs according to the present invention. The ontological relationships between LSOs are selectively traversed using a data service class. Such a traversal through the ontology results in an answer to a query. For example, exemplary service class, Service 1 222, encapsulates a query "Where does John (aperson) live?". An instantiation of the Service 1 class results in a traversal of the ontological relationships from person 218, to domicile 220, to street 212 using method 1 214, city 210, Zip Code, 208, and state 206. This traversal is shown as a solid line surrounding the appropriate objects.
In another example, exemplary service class, Service 2224, encapsulates a query "Where can ABC corporation (a legal entity) be served?". An instantiation of the Service 2 class results in a traversal of the ontological relationships from legal entity 200, to corporation 202, to corporate address 220, to street 212 using method 2 216, to city 210, to Zip Code, 208, and to state 206. This traversal is shown as a broken line surrounding the appropriate objects. The exemplary services exploit the ontological relationships between the LSOs by accessing the relationships in different ways. Service 1 encapsulates a query about a person's address and Service 2 encapsulates a query about a corporations place of service. In each case, the service must determine a street address and each service ultimately uses the same street, city, Zip Code, and state object classes. However, each service obtained access to these objects from a different parent class.
The semantic obj ect construct is not limited to a particular LSO and thus a particular data set. The intelligent data assimilation system is capable of extrapolating LSOs and LSO's appropriate position in multiple workflows from the semantic object. Definitions of the intelligent data assimilation system ontology and the included semantic objects are held in a metadata store of the intelligent data assimilation system and are part of a global internal data dictionary.
A metadata store provides persistence for configuration data used by LSOs at execution time. A search configuration includes: an identification of the data provider, a mapping of the results data fields to the ontology of atomic objects known to the intelligent data assimilation system, and a mapping of the request data fields to the ontology of atomic objects known to the data provider. Finally, the search configuration includes reference to a LSO to use when performing a search at execution time.
FIGS. 7a and 7b are diagrams depicting how semantic objects are described within the metadata store. A semantic object may include other semantic objects and ultimately atomic objects of varying types. For example, a semantic object person 400 includes another semantic object address 402, which includes state 404, city 406, Zip Code 408, and street number 410 atomic objects. The person semantic object also includes atomic objects first name 412 and last name 414. FIG. 7b is a depiction of how the semantic object of FIG. 7a is constructed using a semantic descriptor stored in the metadata store within an embodiment of an intelligent data assimilation system according to the present invention. A generic descriptor, such as "city" data descriptor 420 is a concrete implementation of data descriptor including set methods. A data descriptor describes the name of the data, the type of the data and the data sequence. Since the data is just accessed, a data descriptor interface only exposes get methods for accessing a data provider. A semantic descriptor, such as "person" semantic descriptor 416 exposes methods to retrieve child elements that the person semantic descriptor includes. These child elements could be other semantic descriptors, such as "address" semantic descriptor 418 or data descriptors such as "first name" data descriptor 428 and "last name" data descriptor 430. An interface defined in the intelligent data assimilation system has a concrete class that implements the functionality of the interface. For example, a data descriptor is implemented by a generic data descriptor. This class provides the set methods that are not exposed in the data descriptor interface. During initialization, components creating descriptors use a concrete class to create set values. The following is a list of descriptors:
DataDescriptor -> exposes get methods for name, type, and sequence GenericDescriptor -> concrete implementation of DataDescriptor that includes set methods
PositionalDescriptor -> exposes get methods for position, and length GenericPositionalDescriptor -> concrete implementation of PositionalDescriptor that includes set methods
SemanticDescriptor -> exposes get methods elements, hasMoreElements, getNextElement
GenericS emanticDescriptor -> concrete implementation of S emanticDescriptor that includes set methods
SemanticLisfDescriptor -> exposes get methods getAtomicLength GenericS emanticListDescrip tor -> concrete implementation of SemanticLisfDescriptor that includes set methods
The intelligent data assimilation system model abstracts how the data is represented and focuses more on how pieces of data relate to each other. An intelligent data assimilation system ontology is free to change without inhibiting another data provider's ontology. This provides extensibility because the intelligent data assimilation system and data provider are loosely coupled. An intelligent data assimilation system ontology constructs the semantic context where by search result definitions are defined.
An intelligent data assimilation system deploys a workflow engine that manages the execution of service requests that relate to services. The workflow engine is a state machine that manages tasks within a workflow with each workflow having a start node followed by zero or many task nodes followed by an end node.
In the process of defining a service the designer identifies all the LSOs that are included in a service. The LSOs are then associated with task nodes in the workflow and thus definition of a given service is achieved. When the intelligent data assimilation system receives a request for the fulfillment of a particular service, the intelligent data assimilation system maps the service request to a specific workflow, instantiates the workflow, consolidates the resulting data sets returned by the workflow and invokes any analytical or business rule processing for the workflow. In one embodiment of an intelligent data assimilation system according to the present invention, the requestor is notified that the data set is available. In another embodiment, the data set is stored (herein termed persisted) for later retrieval by the requestor. In another embodiment, the resulting data set is delivered to the requestor.
The intelligent data assimilation system workflow mechanism provides not only for the sequential execution of task nodes including LSOs but also provides for LSOs to be executed in parallel as well as the ability to create contingent relationships between the LSOs. For example the output of a particular LSO may be the input of another LSO. Additionally, the intelligent data assimilation system workflow mechanism allows for task nodes in the workflow to be associated with other workflows that could include services or other business processes, and thus create a mechanism whereby nesting workflows are used to generate different classes of services and workflows reused. FIG. 8 is a diagram depicting the operations of an embodiment of an exemplary workflow according to the present invention. In one embodiment of an intelligent data assimilation system, the workflow processing environment used is a Weblogic Process Integrator (WLPι)suppιied by BEA Systems, Inc. of San Jose California, USA. Workflows are specified using a Weblogic Process Integrator (WLPI) Studio software tool or by creating an XML representation of the workflow. The previously described metadata store includes the data used to construct a particular instance of a workflow used to fulfill a service request. Therefore, the metadata store manages the workflow definition and the LSO definitions that fulfill a service request.
A workflow 600 receives a service request from a service request handler 602 in the form of a XML document. The workflow validates the request at step 604. An exemplary validation includes determining if the request includes sufficient information to route and complete the request using internal decision rules. If the request is invalid, results are returned in a results document 608 without performing any search tasks. If the request is validated, then search tasks are initiated at node 610 to fulfill the request. A workflow can initiate multiple tasks at one time. In this exemplary workflow, task A 612 and task B 614 are initiated at one time. Task A uses LSO A 616 to connect to and retrieve data from an external data provider A 618. At the same time, task B uses LSO B 620 to connect to and retrieve data from an external data provider B 622. The results returned from the two LSOs are merged in a merge process 624. The merging of the results occurs according to an intelligent data assimilation system ontology as previously described. The intelligent data assimilation system allows for additional filtering of a data set once the data has been assimilated. For example, optional "opt-out" rules may be applied to the merged results allowing individuals or entities to exclude personal information from the data set that the individuals or entities do not wish disseminated. An opt-out rule is an internal rule that stops certain information from being released to a service object in response to a data request. In one embodiment of a workflow according to the present invention, the results are returned to the service request handler 608 at the end of the workflow process in the case of an instant search. In an instant search, the data client transmits a search request with the expectation of receiving a search response within a short period of time during which the data client remains coupled to the intelligent data assimilation system. In another embodiment of a search request according to the present invention, the data client transmits a search request with the expectation of returning to the intelligent data assimilation system to retrieve'the search results at a latter time.
FIG. 9 is a collaboration diagram depicting the interactions within an embodiment of a workflow according to the present invention. The depicted embodiment of a workflow includes several optional processes that may be used alone or collectively within a workflow. An intelligent data assimilation system supports the ability to group together the fulfillment of numerous services in a single request. This provides a mechanism whereby an entire order with associated line items can be sent to the intelligent data assimilation system as a single request. When the intelligent data assimilation system receives a search service request message, a service request handler component creates one fulfillment request. For each request item present in the message, a service request handler 700 creates a fulfillment line row 702 in a fulfillment line table 765 and invokes a workflow instance 704 passing the fulfillment line Id as a parameter.
The state of a running workflow is stored in the fulfillment line. When the workflow instance completes, the results of the search are stored in the fulfillment line. For non real-time requests intelligent data assimilation system sends a message to the workflow engine to asynchronously generate a workflow instance. All state information is maintained with the fulfillment line so that any following status requests messages can be answered. It is through the fulfillment line that an intelligent data assimilation system recognizes if a workflow instance is still processing, has failed, or is waiting for user input or the results of a manual search. The invocation of the workflow instance includes a message 703 including the fulfillment line ID and the request parameters in the form of an XML document. The workflow instance receives the message and starts 706 a data collection process.
The workflow instance sends a notification 710 to a workflow state manager 711 during a start notification node 708. The workflow status manager transmits a status update message 719 to the fulfillment line so that a query of the fulfillment line will indicate that the workflow instance is working on the data request. The workflow instance includes an abort node 712 within the workflow instance that waits for an external abort message 716 to be sent to the workflow instance from an external cancel request handler 714. If an abort message is received, the abort node ends 718 the data collection processes of the workflow instance and an abort message 720 is transmitted to the workflow state manager, hi response, the workflow status manager updates the fulfillment line to indicate that the workflow instance has aborted.
The workflow instance checks the input parameters at node 712 to determine if the parameter values are properly formed for making a request of a data provider. If the input parameter values are not properly formed, an exception is thrown 713 to an exception handler. To maintain the appropriate error state in case of failure (runtime exceptions) each workflow instance has an exception handler 715. If anything goes wrong, such as the execution of a LSO raising an exception, the exception handler is executed and the state of the workflow instance is set to "FAILED". In the case of a workflow instance failure, the exception handler sends a status failed message 717 to the workflow state manager so that the status of the workflow instance can be updated in the fulfillment line.
A workflow instance can call multiple LSOs either sequentially or in parallel. In this example, a first LSO 728 is called at step 722 and an invocation message in the form of an XML document 724 is sent to the LSO. The first LSO transmits data request messages 730 to a data provider 732 and receives requested data messages 734 from the data provider. The LSO responds by sending the requested data messages back to the workflow instance.
The workflow instance determines if a complex search is being performed at step 738. If the search is a simple search and only the first LSO is to be called, then the workflow instance ends its search process at step 740 and transmits a status complete message 742 to the workflow state manager. If the workflow instance completed the search, a updated result XML document 764 is sent to the fulfillment line. If the workflow state manager determines that the workflow instance did not complete normally, it throws an exception 760 to the intelligent data assimilation system notifier 762.
If a complex search is requested, such as using several LSOs to query separate databases or an LSO will be collecting data from a manual process, the workflow instance invokes a second LSO 744 and sends a second semantic context message 736 in the form of a XML document to a second LSO (not shown). Clients using services from an intelligent data assimilation system may request a fulfillment to be performed within the same session as the request, as in the case of real-time searches. To facilitate this, an intelligent data assimilation system has services that are configured to process the request synchronously. For searches that cannot be immediately fulfilled (as in the case of manual searches) an intelligent data assimilation system has services that are configured to process asynchronously. An intelligent data assimilation system, therefore, has the ability to synchronize a response message to the completion of workflow and include the resulting output in a response message. When an asynchronous service is requested, an intelligent data assimilation system responds to the data client independently of the completion of the workflow with the idea that the data client will "come back later" to request the results. A workflow instance can call other workflow instances. To do so, a workflow instance calls 746 a sub-workflow instance 750 and transmits an invocation message 748 in the form of a XML document to the sub-workflow instance. The sub-workflow instance operates independently of the workflow instance. However, the workflow instance can wait 746 for the sub-workflow instance to finish processing. The workflow instance may send an exception message in the form of an XML document 754 to the workflow instance if the sub-workflow instance encounters problems during execution. When the sub-workflow instance completes processing, the sub-workflow transmits a workflow end message 752 in the form of a XML document to the workflow instance. The workflow instance receives 756 the end workflow message and merges 758 the data included in the end workflow message into a semantic object for further processing by an intelligent data assimilation system.
Definition of a workflow can be done by directly manipulating a service metadata store or through a product configuration tool. Thus, workflow definition is achieved by defining the processing flow for LSOs included in a particular service. The intelligent data assimilation system views this as a service request that passes through a set of decisions and LSOs. Various workflow patterns can be developed to support searches synchronously or asynchronously and to merge the resulting output into a well-defined structure. Business rules or proprietary algorithms can be applied to the search results to support the specific nature of workflow context.
The workflow engine provides support for pauses in the workflow in order to perform manual intervention. Through this capability, searches that cannot be fulfilled via automation can be performed and the resulting information returned into the workflow and merged with automated searches.
An intelligent data assimilation system provides a notification mechanism that is triggered upon completion of a workflow instance. A notification dispatcher utilizes intelligent data assimilation system services that provide e-Mail, facsimile transmission (fax), telephony, pager, DBMS Storage and JMS functionality. Client applications using the intelligent data assimilation system provide notification information as part of a service request message. An intelligent data assimilation system accepts multiple notification methods for any service request.
The e-Mail service transmits a formatted e-Mail message to an e-Mail address included in the service request message with an identification of the completed search and information on how the results can be obtained. The e-Mail service can also format the search results and include them in the e-Mail message body.
A telephony notification service provides for notification via pagers, fax, and wireless phones.
For systems that provide access to a DBMS, an intelligent data assimilation system updates database tables with values that indicate the completion of the service request.
Other systems may request notification by posting a message to a message queue topic. This is accomplished through an intelligent data assimilation system's JMS service. Clients implement a mechanism to listen for messages posted to the queue. Upon receipt of a message, the data client performs request management activities and submits output requests to the intelligent data assimilation system.
FIG. 10 depicts an exemplary embodiment of a LSO used within a workflow according to the present invention. A LSO is a self contained software object including a set of components that, when combined together, encapsulate the capability to connect to a data provider, generate a knowledgeinstance by extracting and parsing the data returned by the data provider in response to a request, and deliver the knowledge instance into the ontology for use by the current instance of the request so that the data can be consolidated into a result set.
LSOs are reusable components and understand the various access methods to the data provider, protocols on which a search request can be made, and the translation of returned data into a knowledge instance including atomic and semantic structures used by intelligent data assimilation system. These features form a partitioning of several layers of an LSO 500: a translation layer; a data acquisition layer; and the connectivity layer. These layers are realized by software objects termed adaptors. An instantiated LSO includes a translator adaptor 502, an acquisition adaptor 504, and a connection adaptor 506.
The translation layer includes software objects that build a data request to a data provider 508 and subsequently parse the returned data stream 510. A request builder software object 512 understands how to structure the request in the manner defined by the data provider. For example, the data provider may request a Structured Query Language (SQL) statement to be sent to a database, a query string to be sent to a Uniform Resource Locator (URL), or a proprietary query protocol. A response parser software object 514 is capable of recognizing a format in which the data is being returned and translate that data into a meaningful semantic context usable by an intelligent data assimilation system and populate a run-time instance of the ontology representing the semantic and atomic objects included within the LSO . For instance, the data provider may return an SQL recordset or a character delimited text file in which case the response parser will parse the recordset or character delimited text file and generate the necessary atomic objects as specified for the LSO.
The request builder abstracts the construction of request information by building a request at the protocol level. For example, a request builder task creates content that can be immediately fed to a connection adaptor to be executed. Examples request builders are an HTTP request builder for building a request for transmission over an HTTP communications link and a Telnet request builder for transmission of a request over a Telnet communications link. The request builder works on a semantic object to create request queries. This coupled with descriptor information creates a prepared request string. A request builder can utilize different schemes to achieve this, hi an embodiment of a request builder according to the present invention, a request builder utilizes a native object and a native descriptor. The native object holds the implementation details of how a request looks to a particular protocol and data provider. The native object is thus an interface describing an object representing a native view of a search request. For example, an HTTP request builder utilizes a URL native object. The URL native object hides the details of constructing a URL query string that the HTTP protocol uses. The request builder makes use of subordinate classes that perform the work. The request builder then becomes like a container class to perform the action or a controllers. A native descriptor includes ontology information describing a data structure. For example, if a request being sent to a data provider entails a fixed length packet, then the native descriptor includes descriptions of the data structures used to create a fixed length request. Previously described semantic descriptors are used within the native descriptor to describe the ontology information. The response parser abstracts the parsing of a data stream from a data provider and construction of ontological obj ects. The response parser concentrates on the format that the data stream comes back in, not the protocol that retrieved the data stream. For example, if the data stream is character delimited, binary, positional or so on, the parser will deal with those complexities and construct atomic and semantic objects used to populate the portions of arun- time instance of an intelligent data assimilation ontology included within a LSO.
A response parser generates ontological objects such as atomic or semantic objects. To achieve this, the parser iterates through a parser descriptor's semantic descriptor discovering new data structures to be accounted for. When a new data structure is encountered, data is extracted from the data stream and an ontological object is created and added to the runtime instance of the intelligent data assimilation ontology. The parser descriptor includes ontology information describing data structures utilizing semantic descriptors. Stream and string tokenizers are also used when it is convenient to transform the data into string arrays or tokens.
The data acquisition layer includes a data acquisition adaptor software object used to conduct a search. The data acquisition adaptor manages the translation of parameters into a search request 516, supplies the request to the appropriate connectivity protocol in the connectivity layer, obtains the results 518, manages the translation of the data stream into a semantic XML structure and returns the results into the intelligent data assimilation system.
In operation, a data acquisition adaptor initializes a request builder, a response parser, and a connection adaptor. Once that happens, the adaptor has the task of interacting with each of these components in order to build a request, send a query to a data provider and parse a reply and populate an ontology with a knowledge instance generated from the reply. For example, an acquisition adaptor at initialization time generates a request builder, a response parser, and a connection adaptor. The data acquisition adaptor "builds" a request by making a method invocation on the request builder. The data acquisition adaptor takes the output from the request builder and feeds the response to the connection adaptor. When the connection adaptor returns, the output, such as an input stream, is fed into the response parser where semantic and atomic objects are made from the input stream. The semantic and atomic objects are then passed back to the LSO which populates the intelligent data assimilation ontology.
The connectivity layer includes a connection adaptor for transmitting datapackets 520 and receiving data packets 522 to and from an external data provider 521 using various communication and data provider access protocols. An intelligent data assimilation system is capable of connecting to external data providers using a variety of protocols such as HTTP, various RDBMS protocols such as Oracle Net or SQL Net, and other protocols proprietary to specific data providers. The intelligent data assimilation system understands that a request via HTTP may entail that a socket be established or a request to a database may entail making particular drivers available. The connectivity layer of an LSO provides this capability. The request can then subsequently be passed to the data provider, where it is executed and search results obtained.
The connection adaptor hides transport specific headers found within a data stream. The connection adaptor also concatenates responses from a data provider. For example, if a search produces three results then the data stream returned should include the three results concatenated together.
A connection adaptor establishes a link between the connection adaptor' s environment and an external data provider. For example, a connection adaptor knows the external data provider ' s JP address, port number, and whatever information is entailed in contacting the data provider so that data can be sent. A connection adaptor utilizes a plurality of subordinate classes to accomplish the connection adaptor's tasks. The responses from the external data provider are included in an input stream. In one embodiment of a connection adaptor according to the present invention, a connection to the data provider is setup before runtime so that the connection adaptor can send the request data to the external data provider without re-establishing a connection with the data provider for each request.
There are multiple data provider interface components available at each layer of the LSO, with each data provider interface component implementing specific variations of data provider interfaces, hi an embodiment of an LSO according to the present invention, an LSO is implemented using Enterprise Java Bean (EJB) wrappers providing the LSO with the ability to deploy various combinations of data provider interface components into the LSO at runtime. Because a LSO has the LSO's configuration parameters determined at runtime, semantic obj ects defined in the intelligent data assimilation system ontology are not constrained by a single LS O. Thus semantic obj ects can straddle multiple LS Os, and have the ability that, when invoked, semantic objects are intelligent enough to connect, extract, and assimilate data from numerous disparate data providers. The ability of semantic objects to straddle multiple LSOs allows users of an intelligent data assimilation system to manipulate a single construct that represents either atomic data or an abstracted assimilation of data from a plurality of data providers.
FIG. 11 is a diagram depicting the layered architecture of an embodiment of a generic LSO according to the present invention and how this layered architecture is used to generate different LSOs with various connectivity capabilities. The operation between obj ects within the layers has been previously described in FIG. 10. As previously described, a LSO includes a translator layer
502, an acquisition adapter layer 504, and a connection adapter layer 506. As previously described, the translator layer transmits workflow task data requests down through the acquisition adapter layer, through the connection adapter layer, through the connectivity layer to the data provider. The data provider responds to the data request by accessing a data store 503 to satisfy the data request and transmits data satisfying the request to the LSO. The LSO then reformats the requested data and builds previously described semantic obj ects included in the ontology that are used by an intelligent data assimilation system to respond to data requests from external data clients as previously described.
In one embodiment of an LSO according to the present invention, the acquisition layer includes an EJB wrapper sub-layer 526 encapsulating previously described data access objects
528 such as a previously described request builder and response parser. The connection adapter layer includes a language specific protocol sub-layer 530 operably coupled to a high-level protocol sub-layer 532. The high-level protocol sub-layer sits atop a low-level protocol sublayer 534. In an embodiment of a database LSO suitable for connection to an external data provider in the form of a database 501, a database LSO 500 includes database data access objects 505 including Java DataBase Connectivity (JDBC) API to execute SQL statements, thus allowing the database LSO to interact with a SQL-compliant database. The JDBC data access objects use a JDBC connection pool 507 within the language specific protocol sub-layer and a networked database driver 509 in the high-level protocol sub-layer to connect to an external database using a physical communication medium such as a fixed network link 511 or a telephone dial-up link 513 through a TCP/IP low-level protocol sub-layer 547.
In an embodiment of a Web server LSO suitable for connection to an external data provider in the form of a Web server 539, a Web server LSO includes HTTP data access objects 515 for connection to a data provider over a communications network adapted for communications using HTTP . The HTTP data access obj ects use a HTTP connection pool 517 within the language specific protocol sub-layer and HTTP driver API 519 in the high-level protocol sub-layer to connect to an external Web server using a physical communication medium such as a fixed network link 511 or a telephone dial-up link 513 through a TCP/IP low-level protocol sub-layer 547. In an embodiment of a FTP server LSO suitable for connection to an external data provider in the form of aFTP file server 541, aFTP server LSO includes FTP data access objects 521 for connection to a data provider over a communications network adapted for communications using FTP. The FTP data access objects use aFTP connection pool 523 within the language specific protocol sub-layer and FTP driver API 525 in the high-level protocol sub- layer to connect to an external FTP server using a physical communication medium such as a fixed network link 511 or a telephone dial-up link 513 through a TCP/IP low-level protocol sublayer 547.
In an embodiment of a file LSO suitable for connection to an external data provider in the fonn of a file system 543, a file server LSO includes file data access objects 521 for connection to a data provider's file system. The file data access objects use file I/O API 523 within the language specific protocol sub-layer and file system API 525 in the high-level protocol sub-layer to connect to a file system 543 stored on a physical medium such as a floppy disk 551 or magnetic tape 553.
In an embodiment of a manual entry LSO a suitable for connection to an external data provider in the form of a human actor 545, a manual entry LSO includes que list data access objects 521 for management of the manual entry data. The manual entry access objects use an application 537 within the language specific protocol sub-layer and a user interface in the form of forms and screens 539 in the high-level protocol sub-layer to connect to a human actor 545 through a manual entry input device such as a screen and keyboard 549. An application server provides an interface with an intelligent data assimilation system by accepting service requests from external systems and responding with fulfilled service requests. An application server is also that process that ensures all service requests received by the intelligent data assimilation system are reliably executed. An application server accepts numerous service requests that either request an informational response about the state or information managed by the intelligent data assimilation system or service requests to initiate a service request by instantiating a workflow. While the workflow mechanism supports the ability to create automated workflows including a plurality LSOs without any manual intervention, the workflow mechanism also supports the ability to suspend a workflow and allow for a manual process to occur that will provide data back into the system and then resume the workflow execution. Service requests are submitted to an intelligent data assimilation system through the intelligent data assimilation system's exposed API. These service requests are fulfilled against different data providers. Once the service request is fulfilled, the intelligent data assimilation system will optionally host the result until it is requested by the specific system that submitted the service request. Search results can remain in the intelligent data assimilation system for a configurable period of time.
The intelligent data assimilation system API structure is built upon the ability to accept XML messages via a variety of transport mechanisms. In one embodiment of an intelligent data assimilation system according to the present invention, the intelligent data assimilation system' s processing tier handles the following transports: • Java Messaging Service (JMS)
"• Hyper Text Transfer Protocol (HTTP)
• Remote Message Interface (RMI)
Simple Mail Transfer Protocol (SMTP)
The processing tier accepts documents sent according to each of the transport protocols.
Each of the documents includes a XML document including a service request. The intelligent data assimilation system provides an EJB wrapper to various intelligent data assimilation system proxies that provide an interface to build XML messages and send the XML messages via the requested transport. Configuring a deployment descriptor file sets the transport method utilized by the wrapper component.
A data client accessing an intelligent data assimilation system's services can choose to use the wrapper component to facilitate the building and sending of messages or may alternatively choose to communicate directly to the intelligent data assimilation system proxy.
Each message sent to the logical search object application server results in obtaining a response from the intelligent data assimilation system coupled to the intelligent data assimilation system. The type of response depends on the kind of submitted message. In one embodiment of an intelligent data assimilation system according to the present invention, the messages used include: a service request, a status request, a service information request, and a cancellation request. In this embodiment of an intelligent data assimilation system, the messages are transmitted to the intelligent data assimilation system by a data client as an XML document. Each message includes an outer message wrapper including attributes which help identify the type of a message, the source of a message, and the uniqueness of a message from the perspective of a data client.
The basic message elements are described in the following table:
Figure imgf000027_0001
A service request message initiates a request for service in an intelligent data assimilation system. The service request message includes two outer elements 'message' and 'data'. The 'message' element includes attributes to identify the origin and type of message.
The 'data' element includes information about what services have been requested. A service request may include one or many requests for service, with one by the 'product' element for each request. The product element includes a number of attributes indicating which services have been requested. Inside the 'product' element there may be multiple tags indicating parameters for the request, formatting options and notification options.
The following table includes a description of the elements in a service request message:
Figure imgf000027_0002
of the message. Includes format element described earlier. results Element which the workflow uses to populate the result data.
Referring now to FIG. 14, an exemplary XML document service request message 1100 is shown. The exemplary XML document service request message includes an outer message element 1101 and an inner data element 1102 as previously described. Included in the data element is a service-request element 1104 including a service-item element 1106. The service- item element includes two parameter elements, 1108 and 1110, defining a first and last name as part of the search request. The data element further includes a notification element 1112 specifying the type of notification to be used to notify the data client when a data product resulting from the service request is ready. The data element further includes a result-format element for specifying the format of the data product 1114. A results element is included in the data element as a place holder for the resulting data product to be sent to the data client in response to the service request.
A status request message is a request for an intelligent data assimilation system to provide status information for a previously sent service request.
The following table includes a description of the elements in a status request message:
Figure imgf000028_0001
FIG. 15 is an XML document embodiment of an exemplary status message according to the present invention. A message element 1118 includes a data element 1120 as previously described. The message element includes a type attribute 1119 indicating that this is a status message. The data element includes a status element 1122 which includes a plurality of status- request elements 1124, each with its own item and identifier attributes.
A cancel request message is a request for an intelligent data assimilation system to stop processing a previous service request. It is possible that before an intelligent data assimilation system has a chance to process the cancel request message, the processes to be cancelled may have already completed.
The following table includes a description of the elements in a cancel request message: cancel Element which wraps the individual cancel requests. cancel #message-id A unique identifier used to identify this cancel message cancel-request Element which represents a individual cancel request item. cancel-request#ιtem An identifier used to differentiate between cancel requests in the same message idas-id The identifier of the intelligent data assimilation system request a data client is cancelling.
FIG. 16 is an XML document embodiment of an exemplary cancel message according to the present invention. The XML document embodiment of an exemplary cancel message includes a previously described message element 1126 including a previously described data element 1128. The type attribute of the message element is "Cancel" 1127 indicating that this is a cancel message. The data element includes a cancel element 1130 which includes a plurality of cancel request elements 1132.
An output message is transmitted by a data client to an intelligent data assimilation system to request the results of a previously transmitted service request. The following table includes a description of the elements in an output request message: output Element which wraps the individual output requests. output#message-id A unique identifier used to identify this output message- output-request Element which represents a individual output request item. output-request#item An identifier used to differentiate between output requests in the same message idas-id The identifier of the intelligent data assimilation system process you wish to retrieve data from. format Element indicating a formatting option to be used. format#style The XML stylesheet to used to format the output.
FIG. 17 is an XML document embodiment of an exemplary output message according to the present invention. The XML document embodiment of an exemplary output message includes a message element 1134 including a data element 1136 as previously described. The data element includes an output element 1138 including an output-request element. The output- request element includes an identifier attribute 1141 identifying the service request for which a data client requests a result. A format element 1142 specifies the format of the requested data using a style attribute 1144.
A response message represents a response given to a request to the intelligent data assimilation system if the request is correctly structured and formatted. The response includes product data, search results data or status data. It also includes indicators whether the request was successful or not.
The following table includes a description of the elements in a response message:
Figure imgf000030_0001
FIG. 18 is an XML document embodiment of an exemplary response message according to the present invention. The XML document embodiment of an exemplary response message includes a previously described message element 1146 including, a previously described data element 1148. The message element includes a type attribute 1149 indicating that the message is a response message. The data element includes a service-response element 1150 including a response element 1152. The response element includes a response- value element 1154 which indicates the date and time that a workflow task was instantiated to satisfy a subject service request.
A selection message includes the response to a user selection. It is issued when a user makes a selection from a list of options, and the message encapsulates the result.
The following table includes a description of the elements in a selection message:
Figure imgf000030_0002
3elected#item | attribute which give the item which was selected.
A product information message is used to request information about one or more services provided by the intelligent data assimilation system. A response to a product information message includes information describing the service, available formatting options, an intelligent data assimilation system ontology used for the input parameters and the intelligent data assimilation system ontology for the expected output.
In an embodiment of an intelligent data assimilation system according to the present invention, data clients access the services of the intelligent data assimilation system via an 0 application server. The application server provides a consistent interface for communications with data clients over a variety of communication networks. The application server receives search request messages from data clients via a plurality of communication networks, translates the search request messages into a consistent internal format for use by the intelligent data assimilation system, and coordinates the transmission of responses to the data clients via the c communication networks .
FIG. 12 is a sequence diagram illustrating the sequence of events that occur when a request message is received by an embodiment of an application server according to the present invention. An application server includes a processing tier 1000 and a server tier (not shown). The processing tier includes a protocol specific message extractor 1004, a request processor 0 1010, and a request to event translator 1014. A data client transmits 1002 a protocol specific message 1003 including a previously described request message in the form of a XML document. For example, the data client might be a Web browser and the protocol specific message might be a post transmitted via the Internet using HTTP. In one embodiment of a processing tier according to the present invention, the supported protocols include HTTP, SMTP, RMI, and 5 " JMS.
The protocol specific message is received by a protocol specific extractor 1004 and the protocol specific extractor extracts 1006 the embedded XML document request message 1008.
The XML document request message is transmitted to a request processor 1010. The request processor parses 1009 the XML document request message and extracts attributes in order to 0 determine the class of event handler to be instantiated. The request handler then creates the event handler with the extracted attributes and the XML document request message. The XML document request message 1012 then propagates up to the server tier where it is broken into various components 1016 for the workflow by the request to event translator.
FIG. 13 illustrates how an embodiment of an intelligent data assimilation system is 5 coupled to a according to the present invention. An application server includes a server tier including a server controller for controlling the operations of the intelligent data assimilation system. Operations within an intelligent data assimilation system are initiated by a plurality of server events 902 generated within the previously described processing tier 1000 in response to request messages transmitted to the application server by a data client. A state manager 905 coordinates the actions of a plurality of server event handlers 908 responding to the plurality of server events. The server event handlers in turn use the services of the previously described workflow engine 910 to create a instances of previously described workflows. The workflows include a task node 912 that invokes a previously described LS O 914. The properties of the LS O are defined by a set of previously described descriptors stored in the previously described metadata store 916. The LSO includes a previously described translator 918 for interfacing with the workflow task and a previously described acquisition adapter for building queries and parsing responses, and a previously described connection adapter 922 for connection to an external data provider.
The LSO makes a connection to an external data provider such as data providers 926, 930, and 934 using an appropriate communications link such as communications links 924, 928, and 932 as previously described. The LSO collects a data set and creates a semantic object encapsulating the data set as previously described. The workflow uses the semantic object to transmit a data result to results data store 938.
The intelligent data assimilation system manages the data results in XML format in the results data store. A product definition identifies the duration that the data result will remain in the results data store. Data results that persist in the intelligent data assimilation system can be used for archival purposes. The data result is rendered into an appropriate format for transmission to the data client using XSL stylesheets by the formatter 940 according to a format tag included in the request message. The server controller receives the rendered data result from the formatter and transmits the rendered data result 942 to the data client as a response to the data client's service request message.
Although this invention has been described in certain specific embodiments, many additional modifications and variations would be apparent to those skilled in the art. It is therefore to be understood that this invention may be practiced otherwise than as specifically described. Thus, the present embodiments of the invention should be considered in all respects as illustrative and not restrictive, the scope of the invention to be determined by claims supported by this application and the claim's equivalents rather than the foregoing description.

Claims

WHAT IS CLAIMED IS :
1. A method for fulfilling a data service request, the method comprising: providing an ontology description of a data service; operably coupling a first logical search object to a data provider via a first communications link; generating a search request by the first logical search obj ect from the data service request; transmitting the search request by the first logical search object to the data provider via the communications link, receiving by the first logical search object from the data provider via the communications link a data set in response to the search request; and generating by the first logical search object a knowledge instance from the data set using the ontology description.
2. The method of claim 1, further comprising the first communications link commuiricating with a database server.
3. The method of claim 1 the first communications link communicating with a FTP server.
4. The method of claim 1, further comprising the first communications link communicating with a Web server.
5. The method of claim 1, further comprising the first communications link communicating with a file system.
6. The method of claim 1, further comprising the first communications link communicating with a human data provider.
7. The method of claim 1, further comprising the first communications link communicating with a communications protocol proprietary to the data provider.
8. The method of claim 1 , further comprising the data service request is included in a XML document.
9. The method of claim 1, further comprising: operably coupling a first workflow to the logical search object; and transmitting the knowledge instance by the first logical search object to the first workflow.
10. The method of claim 9 further comprising: operably coupling a second logical search object to the first workflow, the first workflow; and encapsulating by the first workflow the ontological relationship between the first and second logical search objects.
11. The method of claim 9, further comprising: operably coupling by an application server a data client to the first workflow via a second communications link, and receiving by the application server from the data client via the second communications link a data service request message, the data service request message including the data service request; transmitting by the application server to the first workflow the data service request message; and transmitting by the first workflow to the logical search object the data service request message.
12. The method of claim 10, further comprising the second communications link communicating using SMTP.
13. The method of claim 10, further comprising the second communications link communicating using JMS.
14. The method of claim 10, further comprising the second communications link communicating using HTTP .
15. The method of claim 10, further comprising the second communications link communicating using RMI.
16. The method of claim 9, wherein the logical search obj ect is specified by the first workflow.
17. The method of claim 11, whereinthefirstworkflow is specified by the application server using the service request message.
18. The method of claim 10 further comprising: providing a formatter; and formatting by the formatter the data set encapsulated in the knowledge instance into a format requested by the data client.
19. The method of claim 9 further comprising operably coupling a second workflow to the first workflow.
20. A method for accessing by a software obj ect a data provider via a communications link, comprising: receiving by the software object from a second software object a search request message document; generating by the software object a data request for the data provider from the search request message document; transmitting by the software object to the data provider the data request via the communications link; receiving by the software object from the data provider a data set via the communications link; and generating by the software object a semantic object from the data set.
21. The method of claim 20 wherein generating by the software object a semantic object from the data set further includes: operably coupling a parser adaptor to the software object; providing a parser semantic description of the data set for use by the parser adaptor; providing a semantic object semantic description; generating by the parser adaptor extracted data from the data set using the parser semantic description; and generating by the parser adaptor the semantic object using the extracted data according to the semantic object semantic description.
22. The method of claim 20 wherein generating by the software object a data request for the data provider from the search request further includes: operably coupling a request builder to the software object; operably coupling a native object to the request builder; encapsulating by the native obj ect implementation details of a data request for the data provider; providing a native semantic description including ontology information describing a data structure used by the request builder to build the data request for the data provider; transmitting by the request builder to the native object the search request; and generating by the native obj ect the data request from the search request using the native semantic description.
23. A data processing system adapted to fulfill a data service request, comprising: a processor; and a memory operably coupled to the processor and having program instructions stored therein, the processor being operable to execute the program instructions, the program instructions including: providing an ontology description of a data service; operably coupling a first logical search obj ect to a data provider via a first communications link; generating a search request by the first logical search obj ect from the data service request; transmitting the search request by the first logical search obj ect to the data provider via the communications link; receiving by the first logical search object from the data provider via the communications link a data set in response to the search request; and generating by the first logical search object a knowledge instance from the data set using the ontology description.
24. The data processing system of claim 23, wherein the first communications link communicates with a database server.
25. The data processing system of claim 23, wherein the first communications link communicates with a FTP server.
26. The data processing system of claim 23, wherein the first communications link communicates with a Web server.
27. The data processing system of claim 23, wherein the first communications link communicates with a file system.
28. The data processing system of claim 23, wherein the first communications link communicates with a human data provider.
29. The data processing system of claim 23, wherein the first communications link communicates with a communications protocol proprietary to the data provider.
30. The data processing system of claim 23, wherein the data service request is included in a XML document.
31. The data processing system of claim 23„ the program instructions further including: operably coupling a first workflow to the logical search object; and transmitting by the first logical search object to the first workflow the knowledge instance.
32. The data processing system of claim 31, the program instructions further including: operably coupling a second logical search object to the first workflow; and encapsulating by the first window the ontological relationship between the first and second logical search objects.
33. The data processing system of claim 31, the program instructions further including: operably coupling by an application server a data client to the first workflow via a second communications link; and receiving by the application server from the data client via the second communications link a data service request message, the data service request message including the data service request; transmitting by the application server to the first workflow the data service request message; and transmitting by the first workflow to the logical search object the data service request message.
34. The data processing system of claim 32, wherein the second communications link communicates using SMTP.
35. The data processing system of claim 32, wherein the second communications link communicates using JMS.
36. The data processing system of claim 32, wherein the second communications link communicates using HTTP.
37. The data processing system of claim 32, wherein the second communications link communicates using RMI.
38. The data processing system of claim 32, wherein the logical search object is specified by the first workflow.
39. The data processing system of claim 33, wherein the first workflow is specified by the application server using the service request message.
40. The data processing system of claim 32, the program instructions further including: providing a formatter; and formatting by the formatter the data set encapsulated in the knowledge instance into a format requested by the data client.
41. The data processing system of claim 31 , the program instructions further including operably coupling a second workflow to the first workflow.
42. A data processing system adapted to access a data provider via a communications link, comprising: a processor; and a memory operably coupled to the processor and having program instructions stored therein, the processor being operable to execute the program instructions, the program instructions including: receiving by a software object a search request message document; generating by the software obj ect a data request for the data provider from the search request message document; transmitting by the software object to the data provider the data request via the communications link; receiving by the software object from the data provider a data set via the communications link; and generating by the software object a semantic object from the data set.
43. The data processing system of claim 42, wherein the program instructions for generating by the software object a semantic object from the data set further include: operably coupling a parser adaptor to the software object; providing a parser semantic description of the data set for use by the parser adaptor; providing a semantic object semantic description; generating by the parser adaptor extracted data from the data set using the parser semantic description; and generating by the parser adaptor the semantic object using the extracted data according to the semantic object semantic description.
44. The data processing system of claim 42, wherein the program instructions for generating by the software object a data request for the data provider from the search request further include: operably coupling a request builder to the software object; operably coupling a native object to the request builder; encapsulating by the native obj ect implementation details of a data request for the data provider; providing a native semantic description including ontology information describing a data structure used by the request builder to build the data request for the data provider; transmitting by the request builder to the native object the search request; and generating by the native object the data request from the search request using the native semantic description.
45. A computer readable media embodying program instructions for execution by a computer, the computer program instructions adapting a computer to fulfill a data service request, the program instructions comprising: providing an ontology description of a data service; operably coupling a first logical search object to a data provider via a first communications link; generating a search request by the first logical search object from the data service request; transmitting the search request by the first logical search object to the data provider via the communications link; receiving by the first logical search object from the data provider via the communications link a data set in response to the search request; and generating by the first logical search object a knowledge instance from the data set using the ontology description.
46. The computer readable media of claim 45, wherein the first communications link communicates with a database server.
47. The computer readable media of claim 45, wherein the first communications link communicates with a FTP server.
48. The computer readable media of claim 45, wherein the first communications link communicates with a Web server.
49. The computer readable media of claim 45 , wherein the first communications link communicates with a file system.
50. The computer readable media of claim 45, wherein the first communications link communicates with a human data provider.
51. The computer readable media of claim 45, wherein the first communications link communicates with a communications protocol proprietary to the data provider.
52. The computer readable media of claim 45, wherein the data service request is included in a XML document.
53. The computer readable media of claim 45, the program instructions further comprising: operably coupling a first workflow to the logical search object; and transmitting by the first logical search object to the first workflow the knowledge instance.
54. The computer readable media of claim 53, the program instructions further comprising: operably coupling a second logical search object to the first workflow; and encapsulating by the first workflow the ontological relationship between the first and second logical search objects.
55. The computer readable media of claim 53, the program instructions further comprising: operably coupling by an application server a data client to the first workflow via a second communications link, and receiving by the application server from the data client via the second communications link a data service request message, the data service request message including the data service request; transmitting by the application server to the first workflow the data service request message; and transmitting by the first workflow to the logical search object the data service request message.
56. The computer readable media of claim 54, wherein the second communications link communicates using SMTP.
57. The computer readable media of claim 54, wherein the second communications link communicates using JMS.
58. The computer readable media of claim 54, wherein the second communications link communicates using HTTP.
59. The computer readable media of claim 54, wherein the second communications link communicates using RMI.
60. The computer readable media of claim 54, wherein the logical search object is specified by the first workflow.
61. The computer readable media of claim 54, wherein the first workflow is specified by the application server using the service request message.
62. The computer readable media of claim 54, the program instructions further comprising: providing a formatter; and formatting by the formatter the data set encapsulated in the knowledge instance into a format requested by the data client.
63. The computer readable media of claim 53, the program instructions further comprising operably coupling a second workflow to the first workflow.
64. A computer readable media embodying program instructions for execution by a computer, the computer program instructions adapting a computer t o access a data provider via a communications link, the program instructions comprising: receiving by a software object a search request message document; generating by the software object a data request for the data provider from the search request message document; transmitting by the software object to the data provider the data request via the communications link; receiving by the software object from the data provider a data set via the communications link; and generating by the software object a semantic object from the data set.
65. The data processing system of claim 64 wherein the program instructions for generating by the software object a semantic object from the data set further comprise: operably coupling a parser adaptor to the software object; providing a parser semantic description of the data set for use by the parser adaptor; providing a semantic object semantic description; generating by the parser adaptor extracted data from the data set using the parser semantic description; and generating by the parser adaptor the semantic object using the extracted data according to the semantic object semantic description.
66. The data processing system of claim 64 wherein the program instructions for generating by the software object a data request for the data provider from the search request further comprise: operably coupling a request builder to the software object; operably coupling a native object to the request builder; encapsulating by the native obj ect implementation details of a data request for the data provider; providing a native semantic description including ontology information describing a data structure used by the request builder to build the data request for the data provider; transmitting by the request builder to the native object the search request; and generating by the native object the data request from the search request using the native semantic description.
PCT/US2002/009528 2001-03-26 2002-03-26 Method and apparatus for intelligent data assimilation WO2002077904A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP02728596A EP1435047A4 (en) 2001-03-26 2002-03-26 Method and apparatus for intelligent data assimilation
AU2002258640A AU2002258640B2 (en) 2001-03-26 2002-03-26 Method and apparatus for intelligent data assimilation

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US27893001P 2001-03-26 2001-03-26
US60/278,930 2001-03-26
US09/916,243 US6847974B2 (en) 2001-03-26 2001-07-25 Method and apparatus for intelligent data assimilation
US09/916,243 2001-07-25

Publications (2)

Publication Number Publication Date
WO2002077904A2 true WO2002077904A2 (en) 2002-10-03
WO2002077904A3 WO2002077904A3 (en) 2004-05-06

Family

ID=26959353

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2002/009528 WO2002077904A2 (en) 2001-03-26 2002-03-26 Method and apparatus for intelligent data assimilation

Country Status (5)

Country Link
US (1) US6847974B2 (en)
EP (1) EP1435047A4 (en)
CN (1) CN1333362C (en)
AU (1) AU2002258640B2 (en)
WO (1) WO2002077904A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7373338B2 (en) 2003-03-18 2008-05-13 British Telecommunications Public Limited Company Access control to shared resources
CN102025652A (en) * 2010-06-04 2011-04-20 西本新干线股份有限公司 Service bus and message processing method

Families Citing this family (253)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8706538B1 (en) * 2002-04-17 2014-04-22 Paul V. Morinville Business process nesting method and apparatus
US20120016683A1 (en) * 2000-02-01 2012-01-19 Paul Morinville Automated Execution of Business Processes Using Reverse Nesting
US7490092B2 (en) 2000-07-06 2009-02-10 Streamsage, Inc. Method and system for indexing and searching timed media information based upon relevance intervals
US20050033583A1 (en) * 2000-07-27 2005-02-10 Bergeron Heather Ellen Processing transactions using a structured natural language
US20050010394A1 (en) * 2000-07-27 2005-01-13 Bergeron Heather Ellen Configuring a semantic network to process transactions
US20050033605A1 (en) * 2000-07-27 2005-02-10 Bergeron Heather Ellen Configuring a semantic network to process health care transactions
US20060173672A1 (en) * 2000-07-27 2006-08-03 Bergeron Heather E Processing health care transactions using a semantic network
US20030036917A1 (en) * 2001-04-25 2003-02-20 Metallect Corporation Service provision system and method
US7146399B2 (en) * 2001-05-25 2006-12-05 2006 Trident Company Run-time architecture for enterprise integration with transformation generation
US8412746B2 (en) 2001-05-25 2013-04-02 International Business Machines Corporation Method and system for federated querying of data sources
US20050038629A1 (en) * 2001-05-25 2005-02-17 Ruth Amaru Pricing of enterprise information resource management systems
US20040093559A1 (en) * 2001-05-25 2004-05-13 Ruth Amaru Web client for viewing and interrogating enterprise data semantically
US7673282B2 (en) * 2001-05-25 2010-03-02 International Business Machines Corporation Enterprise information unification
US20030163450A1 (en) * 2001-05-25 2003-08-28 Joram Borenstein Brokering semantics between web services
US7877421B2 (en) * 2001-05-25 2011-01-25 International Business Machines Corporation Method and system for mapping enterprise data assets to a semantic information model
US20060064666A1 (en) 2001-05-25 2006-03-23 Amaru Ruth M Business rules for configurable metamodels and enterprise impact analysis
US7099885B2 (en) * 2001-05-25 2006-08-29 Unicorn Solutions Method and system for collaborative ontology modeling
US20030101170A1 (en) * 2001-05-25 2003-05-29 Joseph Edelstein Data query and location through a central ontology model
US20030097457A1 (en) * 2001-08-08 2003-05-22 Amitabh Saran Scalable multiprocessor architecture for business computer platforms
FR2829337B1 (en) * 2001-09-03 2003-10-31 Schneider Automation AUTOMATION EQUIPMENT CONNECTED TO A TCP / IP NETWORK
US8042132B2 (en) 2002-03-15 2011-10-18 Tvworks, Llc System and method for construction, delivery and display of iTV content
US8365230B2 (en) 2001-09-19 2013-01-29 Tvworks, Llc Interactive user interface for television applications
JP4161212B2 (en) * 2001-09-21 2008-10-08 野路 福三 Table data generation computer system used for corporate management
US7284196B2 (en) * 2001-10-05 2007-10-16 Vitria Technology, Inc. Vocabulary and syntax based data transformation
GB2381340A (en) * 2001-10-27 2003-04-30 Hewlett Packard Co Document generation in a distributed information network
EP3401794A1 (en) * 2002-01-08 2018-11-14 Seven Networks, LLC Connection architecture for a mobile network
US7369984B2 (en) * 2002-02-01 2008-05-06 John Fairweather Platform-independent real-time interface translation by token mapping without modification of application code
US7703116B1 (en) 2003-07-11 2010-04-20 Tvworks, Llc System and method for construction, delivery and display of iTV applications that blend programming information of on-demand and broadcast service offerings
US7945636B2 (en) * 2002-05-15 2011-05-17 In-Store Broadcasting Network, Llc Providing a multi-tier enterprise level application
US7047488B2 (en) 2002-07-19 2006-05-16 Open Invention Network Registry driven interoperability and exchange of documents
US7729922B2 (en) 2002-08-15 2010-06-01 Open Invention Network, Llc Dynamic interface between BPSS conversation management and local business management
US20050005116A1 (en) * 2002-09-18 2005-01-06 Commerce One Operations, Inc. Dynamic interoperability contract for web services
US8220018B2 (en) 2002-09-19 2012-07-10 Tvworks, Llc System and method for preferred placement programming of iTV content
CA2409079A1 (en) * 2002-10-21 2004-04-21 Ibm Canada Limited-Ibm Canada Limitee Creating multiple and cascading business interpretations from raw application data using transformation layering
KR20050065638A (en) * 2002-10-28 2005-06-29 제이지알 어퀴지션 인코퍼레이티드 Transparent ejb support and horizontal data partitioning
US20040153558A1 (en) * 2002-10-31 2004-08-05 Mesut Gunduc System and method for providing java based high availability clustering framework
US7640267B2 (en) * 2002-11-20 2009-12-29 Radar Networks, Inc. Methods and systems for managing entities in a computing device using semantic objects
US7584208B2 (en) * 2002-11-20 2009-09-01 Radar Networks, Inc. Methods and systems for managing offers and requests in a network
US7917468B2 (en) * 2005-08-01 2011-03-29 Seven Networks, Inc. Linking of personal information management data
US8468126B2 (en) * 2005-08-01 2013-06-18 Seven Networks, Inc. Publishing data in an information community
US7853563B2 (en) * 2005-08-01 2010-12-14 Seven Networks, Inc. Universal data aggregation
US7143081B2 (en) * 2003-02-12 2006-11-28 International Business Machines Corporation Automated abstract database generation through existing application statement analysis
US8578411B1 (en) 2003-03-14 2013-11-05 Tvworks, Llc System and method for controlling iTV application behaviors through the use of application profile filters
US11381875B2 (en) 2003-03-14 2022-07-05 Comcast Cable Communications Management, Llc Causing display of user-selectable content types
EP1460540A1 (en) * 2003-03-20 2004-09-22 Dassault Systèmes S.A. Server process with interface adapters for accessing data from various types of client processes
US7761480B2 (en) * 2003-07-22 2010-07-20 Kinor Technologies Inc. Information access using ontologies
US7395271B2 (en) * 2003-08-25 2008-07-01 Oracle International Corporation Mechanism to enable evolving XML schema
US7814470B2 (en) 2003-08-27 2010-10-12 International Business Machines Corporation Multiple service bindings for a real time data integration service
US20050240592A1 (en) * 2003-08-27 2005-10-27 Ascential Software Corporation Real time data integration for supply chain management
US8307109B2 (en) 2003-08-27 2012-11-06 International Business Machines Corporation Methods and systems for real time integration services
US7814142B2 (en) 2003-08-27 2010-10-12 International Business Machines Corporation User interface service for a services oriented architecture in a data integration platform
US20050222931A1 (en) * 2003-08-27 2005-10-06 Ascential Software Corporation Real time data integration services for financial information data integration
US8041760B2 (en) 2003-08-27 2011-10-18 International Business Machines Corporation Service oriented architecture for a loading function in a data integration platform
US8060553B2 (en) * 2003-08-27 2011-11-15 International Business Machines Corporation Service oriented architecture for a transformation function in a data integration platform
US8819734B2 (en) 2003-09-16 2014-08-26 Tvworks, Llc Contextual navigational control for digital television
US7937433B1 (en) * 2003-09-23 2011-05-03 Embarq Holdings Company, Llc Queuing connector to promote message servicing
US8453196B2 (en) 2003-10-14 2013-05-28 Salesforce.Com, Inc. Policy management in an interoperability network
EP1530139A1 (en) * 2003-11-05 2005-05-11 Sap Ag Method and computer system for workflow management
US7844639B2 (en) * 2003-11-24 2010-11-30 Ebay Inc. Backward compatibility in database schemas
US7818759B2 (en) 2003-11-24 2010-10-19 Ebay Inc. API and business language schema design framework for message exchanges
US8775654B2 (en) 2003-12-19 2014-07-08 Salesforce.Com, Inc. Apparatus and methods for mediating messages
US20050166047A1 (en) * 2004-01-23 2005-07-28 Tollett Wayne J. Intra-encapsulation intelligent searching of an object
US7433876B2 (en) * 2004-02-23 2008-10-07 Radar Networks, Inc. Semantic web portal and platform
US7725299B2 (en) * 2004-03-01 2010-05-25 Purdue Research Foundation Multi-tier and multi-domain distributed rapid product configuration and design system
US7287042B1 (en) * 2004-03-03 2007-10-23 A9.Com, Inc. Search engine system supporting inclusion of unformatted search string after domain name portion of URL
US7761406B2 (en) 2004-03-16 2010-07-20 International Business Machines Corporation Regenerating data integration functions for transfer from a data integration platform
GB2414572B (en) * 2004-05-24 2009-03-18 Orangery Software Ltd Data processing systems and methods
US20050262119A1 (en) * 2004-05-24 2005-11-24 Gary Mawdsley Data processing systems and methods
US20050267918A1 (en) * 2004-05-28 2005-12-01 Gatev Andrei A System and method for bundling deployment descriptor files within an enterprise archive for fast reliable resource setup at deployment time
US7877726B2 (en) * 2004-07-13 2011-01-25 The Mitre Corporation Semantic system for integrating software components
US7823123B2 (en) * 2004-07-13 2010-10-26 The Mitre Corporation Semantic system for integrating software components
US7725605B2 (en) 2004-08-06 2010-05-25 Salesforce.Com, Inc. Providing on-demand access to services in a wide area network
US7593846B2 (en) * 2004-09-02 2009-09-22 Microsoft Corporation Method and apparatus for building semantic structures using self-describing fragments
US7505989B2 (en) * 2004-09-03 2009-03-17 Biowisdom Limited System and method for creating customized ontologies
US7493333B2 (en) 2004-09-03 2009-02-17 Biowisdom Limited System and method for parsing and/or exporting data from one or more multi-relational ontologies
US20060074832A1 (en) * 2004-09-03 2006-04-06 Biowisdom Limited System and method for utilizing an upper ontology in the creation of one or more multi-relational ontologies
US7496593B2 (en) * 2004-09-03 2009-02-24 Biowisdom Limited Creating a multi-relational ontology having a predetermined structure
US9645712B2 (en) 2004-10-01 2017-05-09 Grand Central Communications, Inc. Multiple stakeholders for a single business process
WO2006045102A2 (en) 2004-10-20 2006-04-27 Seven Networks, Inc. Method and apparatus for intercepting events in a communication system
US8010082B2 (en) * 2004-10-20 2011-08-30 Seven Networks, Inc. Flexible billing architecture
US7643818B2 (en) 2004-11-22 2010-01-05 Seven Networks, Inc. E-mail messaging to/from a mobile terminal
US7706781B2 (en) * 2004-11-22 2010-04-27 Seven Networks International Oy Data security in a mobile e-mail service
US8214461B1 (en) * 2004-11-23 2012-07-03 Hewlett-Packard Development Company, L.P. Method of processing request by server computer system
FI117152B (en) 2004-12-03 2006-06-30 Seven Networks Internat Oy E-mail service provisioning method for mobile terminal, involves using domain part and further parameters to generate new parameter set in list of setting parameter sets, if provisioning of e-mail service is successful
US10394543B2 (en) 2005-01-21 2019-08-27 International Business Machines Corporation Lifecycle objectification of non-activity objects in an activity thread
US20060212324A1 (en) * 2005-02-22 2006-09-21 Transparency Software, Inc. Graphical representation of organization actions
US20060200496A1 (en) * 2005-02-22 2006-09-07 Transparency Software, Inc. Organization action incidents
AT8155U3 (en) * 2005-02-24 2006-08-15 Shanker Bhawani Dipl Ing PROCESS AND COMPUTER TECHNICAL IMPLEMENTATION FOR THE FLEXIBLE CLASSIFICATION OF PRODUCTS AND PRODUCT SYSTEMS
US20060195425A1 (en) * 2005-02-28 2006-08-31 Microsoft Corporation Composable query building API and query language
US7877703B1 (en) 2005-03-14 2011-01-25 Seven Networks, Inc. Intelligent rendering of information in a limited display environment
US7796742B1 (en) 2005-04-21 2010-09-14 Seven Networks, Inc. Systems and methods for simplified provisioning
US8438633B1 (en) 2005-04-21 2013-05-07 Seven Networks, Inc. Flexible real-time inbox access
US7818667B2 (en) 2005-05-03 2010-10-19 Tv Works Llc Verification of semantic constraints in multimedia data and in its announcement, signaling and interchange
US20060271582A1 (en) * 2005-05-25 2006-11-30 Caterpillar Inc. System and method for analyzing raw data files
US7720904B2 (en) * 2005-05-27 2010-05-18 Microsoft Corporation Entity projection
WO2006136660A1 (en) 2005-06-21 2006-12-28 Seven Networks International Oy Maintaining an ip connection in a mobile network
US8069166B2 (en) 2005-08-01 2011-11-29 Seven Networks, Inc. Managing user-to-user contact with inferred presence information
EP1758051A1 (en) * 2005-08-22 2007-02-28 Ubs Ag System, method and computer program for workflow based on data processing
US8544098B2 (en) * 2005-09-22 2013-09-24 Alcatel Lucent Security vulnerability information aggregation
US7899817B2 (en) * 2005-10-05 2011-03-01 Microsoft Corporation Safe mode for inverse query evaluations
US8156208B2 (en) * 2005-11-21 2012-04-10 Sap Ag Hierarchical, multi-tiered mapping and monitoring architecture for service-to-device re-mapping for smart items
US7860968B2 (en) * 2005-11-21 2010-12-28 Sap Ag Hierarchical, multi-tiered mapping and monitoring architecture for smart items
US8005879B2 (en) * 2005-11-21 2011-08-23 Sap Ag Service-to-device re-mapping for smart items
US20070118496A1 (en) * 2005-11-21 2007-05-24 Christof Bornhoevd Service-to-device mapping for smart items
US7627661B2 (en) * 2005-12-20 2009-12-01 At&T Intellectual Property, I, L.P. Methods, systems, and computer program products for implementing ontological domain services
US7590649B2 (en) 2005-12-20 2009-09-15 At&T Intellectual Property, I,L.P. Methods, systems, and computer program products for implementing intelligent agent services
US8849691B2 (en) 2005-12-29 2014-09-30 Microsoft Corporation Modeling user input and interaction in workflow based applications
US20070179833A1 (en) * 2006-01-31 2007-08-02 Infosys Technologies Ltd. Assisted business process exception management
WO2007133308A2 (en) * 2006-02-16 2007-11-22 United States Postal Service Centralized processing and management system
US7769395B2 (en) 2006-06-20 2010-08-03 Seven Networks, Inc. Location-based operations and messaging
US8495004B2 (en) 2006-03-27 2013-07-23 International Business Machines Corporation Determining and storing at least one results set in a global ontology database for future use by an entity that subscribes to the global ontology database
US8522341B2 (en) 2006-03-31 2013-08-27 Sap Ag Active intervention in service-to-device mapping for smart items
US7890568B2 (en) * 2006-04-28 2011-02-15 Sap Ag Service-to-device mapping for smart items using a genetic algorithm
US8296408B2 (en) * 2006-05-12 2012-10-23 Sap Ag Distributing relocatable services in middleware for smart items
US7602471B2 (en) * 2006-05-17 2009-10-13 Taiwan Semiconductor Manufacturing Company, Ltd. Apparatus and method for particle monitoring in immersion lithography
US8131838B2 (en) 2006-05-31 2012-03-06 Sap Ag Modular monitor service for smart item monitoring
US8296413B2 (en) * 2006-05-31 2012-10-23 Sap Ag Device registration in a hierarchical monitor service
US8065411B2 (en) * 2006-05-31 2011-11-22 Sap Ag System monitor for networks of nodes
US20070290787A1 (en) * 2006-06-20 2007-12-20 Trevor Fiatal Systems and methods for group messaging
US11093987B2 (en) * 2006-06-30 2021-08-17 Whapps Llc System and method for providing data for on-line product catalogues
US7707161B2 (en) * 2006-07-18 2010-04-27 Vulcan Labs Llc Method and system for creating a concept-object database
EP1883011A1 (en) * 2006-07-21 2008-01-30 Software Ag System and methods for generating runtime messages
US8396788B2 (en) * 2006-07-31 2013-03-12 Sap Ag Cost-based deployment of components in smart item environments
US8924838B2 (en) * 2006-08-09 2014-12-30 Vcvc Iii Llc. Harvesting data from page
US7870163B2 (en) * 2006-09-28 2011-01-11 Oracle International Corporation Implementation of backward compatible XML schema evolution in a relational database system
JP5308621B2 (en) * 2006-10-05 2013-10-09 日立オートモティブシステムズ株式会社 Map data distribution system
US20080097800A1 (en) * 2006-10-23 2008-04-24 Kutlev Krassimir G System and method for marketing and developing a custom substation
CN101174963A (en) * 2006-10-31 2008-05-07 华为技术有限公司 Method and system for implementing on-line user searching
US20080148284A1 (en) * 2006-12-15 2008-06-19 Maui Media Lab Llc Apparatus and method for developing and executing applications with declarative objects
US20090037397A1 (en) * 2007-05-03 2009-02-05 Sourcecode Technology Holding, Inc. Methods and apparatus for providing context search results in process design
US20080294425A1 (en) * 2007-05-21 2008-11-27 Justsystems Evans Research, Inc. Method and apparatus for performing semantic update and replace operations
US20080294426A1 (en) * 2007-05-21 2008-11-27 Justsystems Evans Research, Inc. Method and apparatus for anchoring expressions based on an ontological model of semantic information
US20080295013A1 (en) * 2007-05-21 2008-11-27 Justsystems Evans Research, Inc. Method and apparatus for performing semantically informed text operations
US20080294427A1 (en) * 2007-05-21 2008-11-27 Justsystems Evans Research, Inc. Method and apparatus for performing a semantically informed merge operation
US8805425B2 (en) * 2007-06-01 2014-08-12 Seven Networks, Inc. Integrated messaging
US8693494B2 (en) * 2007-06-01 2014-04-08 Seven Networks, Inc. Polling
US20080306798A1 (en) * 2007-06-05 2008-12-11 Juergen Anke Deployment planning of components in heterogeneous environments
JP5327497B2 (en) * 2007-07-11 2013-10-30 日立オートモティブシステムズ株式会社 Map data distribution system and map data update method
US20090076887A1 (en) * 2007-09-16 2009-03-19 Nova Spivack System And Method Of Collecting Market-Related Data Via A Web-Based Networking Environment
US8584140B2 (en) * 2007-09-21 2013-11-12 Presenceid, Inc. Systems and methods for receiving and sending messages about changes to data attributes
US8527622B2 (en) * 2007-10-12 2013-09-03 Sap Ag Fault tolerance framework for networks of nodes
US20090106307A1 (en) * 2007-10-18 2009-04-23 Nova Spivack System of a knowledge management and networking environment and method for providing advanced functions therefor
WO2009056448A1 (en) * 2007-10-29 2009-05-07 International Business Machines Corporation Method and apparatus for last message notification
US8336024B2 (en) * 2007-11-08 2012-12-18 International Business Machines Corporation Extracting ontological information from software design data
US8364181B2 (en) * 2007-12-10 2013-01-29 Seven Networks, Inc. Electronic-mail filtering for mobile devices
US8793305B2 (en) 2007-12-13 2014-07-29 Seven Networks, Inc. Content delivery to a mobile device from a content service
US9002828B2 (en) 2007-12-13 2015-04-07 Seven Networks, Inc. Predictive content delivery
US9990674B1 (en) 2007-12-14 2018-06-05 Consumerinfo.Com, Inc. Card registry systems and methods
US8107921B2 (en) 2008-01-11 2012-01-31 Seven Networks, Inc. Mobile virtual network operator
US8862657B2 (en) * 2008-01-25 2014-10-14 Seven Networks, Inc. Policy based content service
US20090193338A1 (en) * 2008-01-28 2009-07-30 Trevor Fiatal Reducing network and battery consumption during content delivery and playback
US20090248670A1 (en) * 2008-03-31 2009-10-01 Trevor Fiatal Content search engine
US8787947B2 (en) * 2008-06-18 2014-07-22 Seven Networks, Inc. Application discovery on mobile devices
US8312033B1 (en) 2008-06-26 2012-11-13 Experian Marketing Solutions, Inc. Systems and methods for providing an integrated identifier
US8078158B2 (en) 2008-06-26 2011-12-13 Seven Networks, Inc. Provisioning applications for a mobile device
US20100004975A1 (en) * 2008-07-03 2010-01-07 Scott White System and method for leveraging proximity data in a web-based socially-enabled knowledge networking environment
US8140578B2 (en) * 2008-07-15 2012-03-20 International Business Machines Corporation Multilevel hierarchical associations between entities in a knowledge system
US8001100B2 (en) * 2008-07-15 2011-08-16 International Business Machines Corporation Leveraging multilevel hierarchical associations to find cross-boundary issues
US8332870B2 (en) * 2008-09-30 2012-12-11 Accenture Global Services Limited Adapter services
US8909759B2 (en) * 2008-10-10 2014-12-09 Seven Networks, Inc. Bandwidth measurement
US7945598B2 (en) * 2008-10-28 2011-05-17 International Business Machines Corporation Methodology for the automatic capture of process information in federated knowledge systems
US8060424B2 (en) 2008-11-05 2011-11-15 Consumerinfo.Com, Inc. On-line method and system for monitoring and reporting unused available credit
US11832024B2 (en) * 2008-11-20 2023-11-28 Comcast Cable Communications, Llc Method and apparatus for delivering video and video-related content at sub-asset level
US20100161441A1 (en) * 2008-12-24 2010-06-24 Comcast Interactive Media, Llc Method and apparatus for advertising at the sub-asset level
US9442933B2 (en) * 2008-12-24 2016-09-13 Comcast Interactive Media, Llc Identification of segments within audio, video, and multimedia items
US8713016B2 (en) 2008-12-24 2014-04-29 Comcast Interactive Media, Llc Method and apparatus for organizing segments of media assets and determining relevance of segments to a query
US11531668B2 (en) 2008-12-29 2022-12-20 Comcast Interactive Media, Llc Merging of multiple data sets
US9354847B2 (en) 2008-12-29 2016-05-31 Microsoft Technology Licensing, Llc Interface infrastructure for a continuation based runtime
US8176043B2 (en) 2009-03-12 2012-05-08 Comcast Interactive Media, Llc Ranking search results
WO2010120934A2 (en) * 2009-04-15 2010-10-21 Evri Inc. Search enhanced semantic advertising
US8200617B2 (en) 2009-04-15 2012-06-12 Evri, Inc. Automatic mapping of a location identifier pattern of an object to a semantic type using object metadata
US8862579B2 (en) * 2009-04-15 2014-10-14 Vcvc Iii Llc Search and search optimization using a pattern of a location identifier
US9037567B2 (en) * 2009-04-15 2015-05-19 Vcvc Iii Llc Generating user-customized search results and building a semantics-enhanced search engine
US8533223B2 (en) 2009-05-12 2013-09-10 Comcast Interactive Media, LLC. Disambiguation and tagging of entities
WO2010138972A2 (en) 2009-05-29 2010-12-02 Abacast, Inc. Selective access of multi-rate data from a server and/or peer
US20100324948A1 (en) * 2009-06-18 2010-12-23 Microsoft Corporation Managing event timelines
US9892730B2 (en) 2009-07-01 2018-02-13 Comcast Interactive Media, Llc Generating topic-specific language models
WO2011075825A1 (en) * 2009-12-21 2011-06-30 Kik Interactive, Inc. Systems and methods for accessing and controlling media stored remotely
WO2011126889A2 (en) 2010-03-30 2011-10-13 Seven Networks, Inc. 3d mobile user interface with configurable workspace management
US10078674B2 (en) * 2010-06-04 2018-09-18 Mcl Systems Limited Integrated workflow and database transactions
US8838783B2 (en) 2010-07-26 2014-09-16 Seven Networks, Inc. Distributed caching for resource and mobile network traffic management
US9077630B2 (en) 2010-07-26 2015-07-07 Seven Networks, Inc. Distributed implementation of dynamic wireless traffic policy
GB2500333B (en) 2010-07-26 2014-10-08 Seven Networks Inc Mobile application traffic optimization
US9043433B2 (en) 2010-07-26 2015-05-26 Seven Networks, Inc. Mobile network traffic coordination across multiple applications
US9560130B2 (en) 2010-09-30 2017-01-31 Microsoft Technology Licensing, Llc Presenting availability statuses of synchronized objects
WO2012060995A2 (en) 2010-11-01 2012-05-10 Michael Luna Distributed caching in a wireless network of content delivered for a mobile application over a long-held request
US8843153B2 (en) 2010-11-01 2014-09-23 Seven Networks, Inc. Mobile traffic categorization and policy for network use optimization while preserving user experience
WO2012061437A1 (en) 2010-11-01 2012-05-10 Michael Luna Cache defeat detection and caching of content addressed by identifiers intended to defeat cache
US9060032B2 (en) 2010-11-01 2015-06-16 Seven Networks, Inc. Selective data compression by a distributed traffic management system to reduce mobile data traffic and signaling traffic
US8166164B1 (en) 2010-11-01 2012-04-24 Seven Networks, Inc. Application and network-based long poll request detection and cacheability assessment therefor
US8484314B2 (en) 2010-11-01 2013-07-09 Seven Networks, Inc. Distributed caching in a wireless network of content delivered for a mobile application over a long-held request
US8326985B2 (en) 2010-11-01 2012-12-04 Seven Networks, Inc. Distributed management of keep-alive message signaling for mobile network resource conservation and optimization
US9330196B2 (en) 2010-11-01 2016-05-03 Seven Networks, Llc Wireless traffic management system cache optimization using http headers
GB2499534B (en) 2010-11-01 2018-09-19 Seven Networks Llc Caching adapted for mobile application behavior and network conditions
GB2500327B (en) 2010-11-22 2019-11-06 Seven Networks Llc Optimization of resource polling intervals to satisfy mobile device requests
GB2495463B (en) 2010-11-22 2013-10-09 Seven Networks Inc Aligning data transfer to optimize connections established for transmission over a wireless network
GB2501416B (en) 2011-01-07 2018-03-21 Seven Networks Llc System and method for reduction of mobile network traffic used for domain name system (DNS) queries
WO2012145533A2 (en) 2011-04-19 2012-10-26 Seven Networks, Inc. Shared resource and virtual resource management in a networked environment
GB2504037B (en) 2011-04-27 2014-12-24 Seven Networks Inc Mobile device which offloads requests made by a mobile application to a remote entity for conservation of mobile device and network resources
WO2012149434A2 (en) 2011-04-27 2012-11-01 Seven Networks, Inc. Detecting and preserving state for satisfying application requests in a distributed proxy and cache system
US8930959B2 (en) 2011-05-13 2015-01-06 Orions Digital Systems, Inc. Generating event definitions based on spatial and relational relationships
CN102254246B (en) * 2011-06-17 2014-09-17 中国建设银行股份有限公司 Workflow managing method and system
US9483606B1 (en) 2011-07-08 2016-11-01 Consumerinfo.Com, Inc. Lifescore
WO2013015994A1 (en) 2011-07-27 2013-01-31 Seven Networks, Inc. Monitoring mobile application activities for malicious traffic on a mobile device
US9106691B1 (en) 2011-09-16 2015-08-11 Consumerinfo.Com, Inc. Systems and methods of identity protection and management
US8738516B1 (en) 2011-10-13 2014-05-27 Consumerinfo.Com, Inc. Debt services candidate locator
US9536264B2 (en) 2011-11-14 2017-01-03 Microsoft Technology Licensing, Llc Host agnostic messaging in a continuation based runtime
US8868753B2 (en) 2011-12-06 2014-10-21 Seven Networks, Inc. System of redundantly clustered machines to provide failover mechanisms for mobile traffic management and network resource conservation
US8934414B2 (en) 2011-12-06 2015-01-13 Seven Networks, Inc. Cellular or WiFi mobile traffic optimization based on public or private network destination
US9009250B2 (en) 2011-12-07 2015-04-14 Seven Networks, Inc. Flexible and dynamic integration schemas of a traffic management system with various network operators for network traffic alleviation
WO2013086447A1 (en) 2011-12-07 2013-06-13 Seven Networks, Inc. Radio-awareness of mobile device for sending server-side control signals using a wireless network optimized transport protocol
EP2792188B1 (en) 2011-12-14 2019-03-20 Seven Networks, LLC Mobile network reporting and usage analytics system and method using aggregation of data in a distributed traffic optimization system
US8861354B2 (en) 2011-12-14 2014-10-14 Seven Networks, Inc. Hierarchies and categories for management and deployment of policies for distributed wireless traffic optimization
US9832095B2 (en) 2011-12-14 2017-11-28 Seven Networks, Llc Operation modes for mobile traffic optimization and concurrent management of optimized and non-optimized traffic
US9042266B2 (en) 2011-12-21 2015-05-26 Kik Interactive, Inc. Methods and apparatus for initializing a network connection for an output device
GB2499306B (en) 2012-01-05 2014-10-22 Seven Networks Inc Managing user interaction with an application on a mobile device
WO2013116856A1 (en) 2012-02-02 2013-08-08 Seven Networks, Inc. Dynamic categorization of applications for network access in a mobile network
US9326189B2 (en) 2012-02-03 2016-04-26 Seven Networks, Llc User as an end point for profiling and optimizing the delivery of content and data in a wireless network
US9319283B2 (en) * 2012-02-27 2016-04-19 Xerox Corporation Systems and methods for creating web service compositions
US8812695B2 (en) 2012-04-09 2014-08-19 Seven Networks, Inc. Method and system for management of a virtual network connection without heartbeat messages
US10263899B2 (en) 2012-04-10 2019-04-16 Seven Networks, Llc Enhanced customer service for mobile carriers using real-time and historical mobile application and traffic or optimization data associated with mobile devices in a mobile network
US9853959B1 (en) 2012-05-07 2017-12-26 Consumerinfo.Com, Inc. Storage and maintenance of personal data
WO2014011216A1 (en) 2012-07-13 2014-01-16 Seven Networks, Inc. Dynamic bandwidth adjustment for browsing or streaming activity in a wireless network based on prediction of user behavior when interacting with mobile applications
US9161258B2 (en) 2012-10-24 2015-10-13 Seven Networks, Llc Optimized and selective management of policy deployment to mobile clients in a congested network to prevent further aggravation of network congestion
US9654541B1 (en) 2012-11-12 2017-05-16 Consumerinfo.Com, Inc. Aggregating user web browsing data
US9916621B1 (en) 2012-11-30 2018-03-13 Consumerinfo.Com, Inc. Presentation of credit score factors
US9307493B2 (en) 2012-12-20 2016-04-05 Seven Networks, Llc Systems and methods for application management of mobile device radio state promotion and demotion
US9241314B2 (en) 2013-01-23 2016-01-19 Seven Networks, Llc Mobile device with application or context aware fast dormancy
US8874761B2 (en) 2013-01-25 2014-10-28 Seven Networks, Inc. Signaling optimization in a wireless network for traffic utilizing proprietary and non-proprietary protocols
US8750123B1 (en) 2013-03-11 2014-06-10 Seven Networks, Inc. Mobile device equipped with mobile network congestion recognition to make intelligent decisions regarding connecting to an operator network
US9916133B2 (en) * 2013-03-14 2018-03-13 Microsoft Technology Licensing, Llc Software release workflow management
US10102570B1 (en) 2013-03-14 2018-10-16 Consumerinfo.Com, Inc. Account vulnerability alerts
US9406085B1 (en) 2013-03-14 2016-08-02 Consumerinfo.Com, Inc. System and methods for credit dispute processing, resolution, and reporting
US10880609B2 (en) 2013-03-14 2020-12-29 Comcast Cable Communications, Llc Content event messaging
US10685398B1 (en) 2013-04-23 2020-06-16 Consumerinfo.Com, Inc. Presenting credit score information
US9380126B2 (en) 2013-05-20 2016-06-28 International Business Machines Corporation Data collection and distribution management
US9065765B2 (en) 2013-07-22 2015-06-23 Seven Networks, Inc. Proxy server associated with a mobile carrier for enhancing mobile traffic management in a mobile network
US10146865B2 (en) * 2013-10-04 2018-12-04 Orions Digital Systems, Inc. Tagonomy—a system and method of semantic web tagging
US9477737B1 (en) 2013-11-20 2016-10-25 Consumerinfo.Com, Inc. Systems and user interfaces for dynamic access of multiple remote databases and synchronization of data based on user rules
US11783382B2 (en) 2014-10-22 2023-10-10 Comcast Cable Communications, Llc Systems and methods for curating content metadata
CN105740996A (en) * 2014-12-09 2016-07-06 航天信息股份有限公司 Collective tax declaration management system and method
CN104506496B (en) * 2014-12-10 2017-09-29 山大地纬软件股份有限公司 The method of near-realtime data increment distribution based on Oracle Streams technologies
US10394949B2 (en) * 2015-06-22 2019-08-27 Microsoft Technology Licensing, Llc Deconstructing documents into component blocks for reuse in productivity applications
US10338973B2 (en) * 2015-09-30 2019-07-02 The Mitre Corporation Cross-cloud orchestration of data analytics
US10360072B2 (en) 2015-09-30 2019-07-23 The Mitre Corporation Cross-cloud orchestration of data analytics for a plurality of research domains
US10465124B2 (en) 2016-02-08 2019-11-05 Red Leaf Resources, Inc. Internal friction control systems for hydrocarbonaceous subsiding bodies
CN109308290B (en) * 2018-08-06 2021-12-24 国网安徽省电力有限公司信息通信分公司 Efficient data cleaning and converting method based on CIM
US10880313B2 (en) 2018-09-05 2020-12-29 Consumerinfo.Com, Inc. Database platform for realtime updating of user data from third party sources
US11315179B1 (en) 2018-11-16 2022-04-26 Consumerinfo.Com, Inc. Methods and apparatuses for customized card recommendations
US11238656B1 (en) 2019-02-22 2022-02-01 Consumerinfo.Com, Inc. System and method for an augmented reality experience via an artificial intelligence bot
CN112241276B (en) * 2019-07-19 2022-04-22 华为技术有限公司 Equipment upgrading method and device
US11941065B1 (en) 2019-09-13 2024-03-26 Experian Information Solutions, Inc. Single identifier platform for storing entity data
CN117033143B (en) * 2023-10-08 2024-01-26 常州瑞阳液压成套设备有限公司 Intelligent monitoring data transmission system and method based on running state of big data

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1995003586A1 (en) 1993-07-21 1995-02-02 Persistence Software, Inc. Method and apparatus for generation of code for mapping relational data to objects
BR9407578A (en) * 1993-09-22 1996-07-16 At & T Corp Process for allowing real-time changes to be made by subscribers
US5694549A (en) * 1994-03-03 1997-12-02 Telescan, Inc. Multi-provider on-line communications system
US5694593A (en) 1994-10-05 1997-12-02 Northeastern University Distributed computer database system and method
US5794050A (en) * 1995-01-04 1998-08-11 Intelligent Text Processing, Inc. Natural language understanding system
US5819086A (en) * 1995-06-07 1998-10-06 Wall Data Incorporated Computer system for creating semantic object models from existing relational database schemas
US5875333A (en) 1996-06-07 1999-02-23 Electronic Data Systems Corporation Automatically generating source code for relating a dialog interface with a business object in a computing application
US6115712A (en) * 1996-07-12 2000-09-05 International Business Machines Corporation Mechanism for combining data analysis algorithms with databases on the internet
US5884304A (en) 1996-09-20 1999-03-16 Novell, Inc. Alternate key index query apparatus and method
US5873079A (en) 1996-09-20 1999-02-16 Novell, Inc. Filtered index apparatus and method
US5870739A (en) 1996-09-20 1999-02-09 Novell, Inc. Hybrid query apparatus and method
US6076051A (en) * 1997-03-07 2000-06-13 Microsoft Corporation Information retrieval utilizing semantic representation of text
US5890158A (en) 1997-03-31 1999-03-30 International Business Machines Corporation Method, apparatus, and program storage device for sharing objects with a network server and a database server using a common object model
US6038668A (en) 1997-09-08 2000-03-14 Science Applications International Corporation System, method, and medium for retrieving, organizing, and utilizing networked data
JPH11126209A (en) * 1997-10-23 1999-05-11 Toshiba Corp Information processor, its method and recording medium recorded with information processing program
US6029165A (en) 1997-11-12 2000-02-22 Arthur Andersen Llp Search and retrieval information system and method
WO1999057656A1 (en) 1998-05-01 1999-11-11 Citizen 1 Software, Inc. Method and apparatus for simultaneously accessing a plurality of dispersed databases
US6192364B1 (en) 1998-07-24 2001-02-20 Jarg Corporation Distributed computer database system and method employing intelligent agents
US6564263B1 (en) * 1998-12-04 2003-05-13 International Business Machines Corporation Multimedia content description framework
US6772180B1 (en) * 1999-01-22 2004-08-03 International Business Machines Corporation Data representation schema translation through shared examples
US6199195B1 (en) 1999-07-08 2001-03-06 Science Application International Corporation Automatically generated objects within extensible object frameworks and links to enterprise resources
EP1305746A2 (en) * 1999-08-20 2003-05-02 Black Pearl, Inc. Ontology-driven information system
US6466933B1 (en) * 1999-09-21 2002-10-15 International Business Machines Corporation Delayed delivery of query results or other data from a federated server to a federated client until such information is needed
US6581059B1 (en) * 2000-01-24 2003-06-17 International Business Machines Corporation Digital persona for providing access to personal information
US20020032677A1 (en) * 2000-03-01 2002-03-14 Jeff Morgenthaler Methods for creating, editing, and updating searchable graphical database and databases of graphical images and information and displaying graphical images from a searchable graphical database or databases in a sequential or slide show format
US6513059B1 (en) * 2000-08-24 2003-01-28 Cambira Corporation Adaptive collaborative intelligent network system
AU2002228739A1 (en) * 2000-10-27 2002-05-06 Entigen Corporation Integrating heterogeneous data and tools

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BAYARDO ET AL.: 'InfoSleuth: Agent-based semantic integration of information in open and dynamic environment' PROCEEDINGS OF THE 1997 ACM SIGMOD INTERNATIONAL CONFERENCE 1997, pages 195 - 206, XP000730507 *
HUHNS M. N. ET AL.: 'Ontologies for agents' IEEE INTERNET COMPUTING vol. 1, no. 6, November 1997, pages 81 - 83, XP001152180 *
MENA E. ET AL.: 'OBSERVER: An approach for query processing in global information systems based on interoperation across pre-existing ontologies' PROCEEDINGS OF THE 1ST IFCIS INTERNATIONAL CONFERENCE June 1997, pages 14 - 25, XP010200745 *
See also references of EP1435047A2 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7373338B2 (en) 2003-03-18 2008-05-13 British Telecommunications Public Limited Company Access control to shared resources
CN102025652A (en) * 2010-06-04 2011-04-20 西本新干线股份有限公司 Service bus and message processing method

Also Published As

Publication number Publication date
AU2002258640B2 (en) 2006-11-16
CN1526106A (en) 2004-09-01
US20020194181A1 (en) 2002-12-19
CN1333362C (en) 2007-08-22
WO2002077904A3 (en) 2004-05-06
EP1435047A2 (en) 2004-07-07
EP1435047A4 (en) 2006-08-02
US6847974B2 (en) 2005-01-25

Similar Documents

Publication Publication Date Title
US6847974B2 (en) Method and apparatus for intelligent data assimilation
AU2002258640A1 (en) Method and apparatus for intelligent data assimilation
US20200117486A1 (en) Method and Apparatus for Composite User Interface Creation
US9654429B2 (en) Method and apparatus for composite user interface generation
JP3274131B2 (en) Object-oriented point-to-point communication method and communication device for performing the method
US8924408B2 (en) Automatic generation of database invocation mechanism for external web services
US20030191769A1 (en) Method, system, and program for generating a program capable of invoking a flow of operations
US8219970B2 (en) XML push and remote execution of a wireless applications
US20020147745A1 (en) Method and apparatus for document markup language driven server
EP1390861A2 (en) Service provision system and method
JP2004511034A (en) System and method for integrating heterogeneous networks used in electronic communications and electronic commerce
EP1665039A2 (en) Dynamic program module loading system and method
EP1896939A2 (en) Data centric workflows
US20060136489A1 (en) Mapping a semantic model of business collaboration to a web services meta model
US6917930B1 (en) Database integrity in an internet e-commerce environment
CN114254606A (en) Microservice framework model
Huang et al. Building a web-based federated simulation system with Jini and XML
US7606785B1 (en) Concurrent a-box and t-box generation for use in web ontology language (OWL) database and knowledge base construction
Uhl Architectures enabling scalable Internet search
JavaBeans et al. XUEQIN HUANG XML-Based Messaging in the JSIM Simulation Environment (Under the Direction of JOHN A. MILLER)
WO2002075474A2 (en) Computer application framework

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2002258640

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 01637/DELNP/2003

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2002728596

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 028091132

Country of ref document: CN

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 2002728596

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Ref document number: JP