Am Tage 2000-11-13 gab der Konzern CGMI, der Altavista und 12 weitere Firmen besitzt, in einer stark beworbenen
Pressemitteilung bekannt, dass er 38 Patente auf wesentliche Elemente der Netzindexierung besitzt. Zu diesem Zeitpunkt schrieb CGMI rote Zahlen und versuchte, bei Investoren um Zuversicht zu werben. Am Tage 2000-01-15 erklärte der Chef von CGMI in einem
Interview darüber hinaus, dass er die Verletzer seiner Patente in Kürze zu verfolgen gedenke und dass praktisch jeder, der das Internet oder auch nur sein firmeninternes Intranet absuche und indexiere, zu diesen Verletztern gehöre:
- Internet World:
- Can we talk a bit about some of the ideas or opportunities that you backed off of because money was an issue this past year?
- David Wetherell:
- [...]
Even though AltaVista s doing well in the advertising space, we just think that in order to really ensure strong growth they ought to leverage their position in search licensing to a greater extent. And we saw the opportunity to do that because we think it s a big market. They happen to own 38 patents, many of which we think are fundamental in the search area. They were the first to spider and index the Web. And Digital did a good job of recognizing the potential value of that intellectual property. And they were very thorough in filing broad and deep and narrow patents. And we have another 30 patents that are in application. So we believe that virtually everyone out there who indexes the Web is in violation of at least several of those key patents.
- IW:
- Does that mean you ll pursue that?
- DW:
- Yes, we will. Coming up in the first quarter of 2001.
- IW:
- So we may see some lawsuits ...
- DW:
- If necessary, we will defend it, to the letter of the law.
- IW:
- Are there any specific examples of the types of patents?
- DW:
- If you index a distributed set of databases what the Internet is and even within intranets, corporations, that s one of the patents. We did a press release on this with a list of six or ten of the key areas that the patents cover.
Die meisten europäischen Anträge warten beim EPA noch auf ihre Erteilung.
US 5864863
- I claim
- A system for indexing Web pages of the Internet, the pages stored in computers connected to each other by a communications network, each page having a unique URL (universal record locator), some of the pages including URL links to other pages, comprising: a communication interface for fetching a batch of specified pages of the Web from the computers in accordance with the URLs and URL links;
a parser sequentially partitioning the batch of specified pages into indexable words, each word representing a portion of one specified page or an attribute of one or more portions of the specified page, the parser sequentially assigning locations to the words as they are parsed;
a memory storing index entries, each index entry including a word entry representing a unique one of the words, and one or more location entries indicating where the unique word occurs in the Web;
a query module parsing a query into terms and operators relating the terms;
a search engine using object-oriented stream readers to sequentially read location of specified index entries, the specified index entries corresponding to the terms of a query;
and a display module for presenting qualified pages located by the search engine to users of the Web.
US 5974455
- A system for locating Web pages stored on remotely located computers, each Web page having a unique URL (universal resource locator), at least some of said Web pages including URL links to other ones of the Web pages, the system comprising:
- a communications interface for fetching specified ones of the Web pages from said remotely located computers in accordance with corresponding URLs;
- a Web information file having a set of entries, each entry denoting, for a corresponding Web page, a URL and fetch status information;
- a Web information table, stored in RAM (random access memory), having a set of entries, each entry denoting a fingerprint value and fetch status information for a corresponding Web page; and
- a Web scooter procedure, executed by the system, for fetching and analyzing Web pages, said Web scooter procedure including instructions for fetching Web pages whose Web information file entries meet predefined selection criteria based on said fetch status information, for determining for each URL link in each received Web page whether a corresponding entry already exists in the Web information table, and for each URL link which does not have a corresponding entry in the Web information table adding a new entry in the Web information table and a corresponding new entry in the Web information file.
EP 444358 = US 5495608
- Dynamic optimization of a single relation access.
EP 458698 = US 5276868
- Method and apparatus for pointer compression in structured
EP 520459 = US 5347653
- A method and apparatus for indexing and retrieval of object versions in a versioned data base.
EP 522363 = US 5204958
- System and method for efficiently indexing and storing large database with high-data insertion frequency.
EP 551243 = US 5519858
- Address recognition engine.
EP 567355 (Kombination aus 5 US-Anmeldungen)
- A method and apparatus for operating a multiprocessor computer system having cache memories.
EP 886227
- Full-text indexed mail repository
EP 886228
- WWW-based mail service system
US 5226150
- Apparatus for suppressing an error report from an address for which an error has already been reported
US 5276872
- Concurrency and recovery for index trees with nodal updates using multiple atomic actions by which the trees integrity is preserved during u
US 5276874
- Method for creating a directory tree in main memory using an index file in secondary memory
US 5394143
- Run-length compression of index keys
US 5506984
- Method and system for data retrieval in a distributed system using linked location references on a plurality of nodes
US 5553258
- Method and apparatus for forming an exchange address for a system with different size caches
US 5671406
- Data structure enhancements for in-place sorting of a singly linked list
US 5717921
- Concurrency and recovery for index trees with nodal updates using multiple atomic actions
US 5745890
- Sequential searching of a database index using constraints on word-location pairs
US 5745894
- Method for generating and searching a range-based index of word-locations
US 5745898
- Method for generating a compressed index of information of records of a database
US 5745899
- Method for indexing information of a database
US 5745900
- Method for indexing duplicate database records using a full-record fingerprint
US 5765158
- Method for sampling a compressed index to create a summarized index
US 5765168
- Method for maintaining an index
US 5787435
- Method for mapping an index of a database into an array of files
US 5794242
- Temporally and spatially organized database
US 5797008
- Memory storing an integrated index of database records
US 5809502
- Object-oriented interface for an index
US 5829051
- Apparatus and method for intelligent multiple-probe cache allocation
US 5832500
- Method for searching an index
US 5852820
- Method for optimizing entries for searching an index
US 5864863
- Method for parsing, indexing and searching world-wide-web pages
US 5915251
- Method and apparatus for generating and searching range-based index of word locations
US 5953747
- Apparatus and method for serialized set prediction
US 5956758
- Method for determining target address of computed jump instructions in executable programs
US 5963954
- Method for mapping an index of a database into an array of files
US 5966710
- Method for searching an index
US 5966735
- Array index chaining for tree structure save and restore in a process swapping system
US 5970497
- Method for indexing duplicate records of information of a database
US 5987544
- System interface protocol with optional module cache
US 6016493
- Method for generating a compressed index of information of records of a database
US 6021409
- Method for parsing, indexing and searching world-wide-web pages
US 6029164
- Method and apparatus for organizing and accessing electronic mail messages using labels and full text and label indexing
US 6047286
- Method for optimizing entries for searching an index
US 6067543
- Object-oriented interface for an index
US 6078923
- Memory storing an integrated index of database records
US 6092101
- Method for filtering mail messages for a plurality of client computers connected to a mail service system
US 6105019
- Constrained searching of an index
US 6108770
- Method and apparatus for predicting memory dependence using store sets
US 6112203
- Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis
US 6138113
- Method for identifying near duplicate pages in a hyperlinked database