Abstract

This is a procedure for determining text relevancy and can be used to enhance the retrieval of text documents by search queries. This system helps a user intelligently and rapidly locate information found in large textual databases. A first embodiment determines the common meanings between each word in the query and each word in the document. Then an adjustment is made for words in the query that are not in the documents. Further, weights are calculated for both the semantic components in the query and the semantic components in the documents. These weights are multiplied together, and their products are subsequently added to one another to determine a real value number (similarity coefficient) for each document. Finally, the documents are sorted in sequential order according to their real value number from largest to smallest value. Another, embodiment is for routing documents to topics/headings (sometimes referred to as filtering). Here, the importance of each word in both topics and

Document Type

Patent

Patent Number

US 5,576,954

Application Serial Number

08/148,688

Issue Date

11-19-1996

Current Assignee

UCFRF

Assignee at Issuance

UCFRF

College

College of Engineering and Computer Science (CECS)

Department

Computer Science

Allowance Date

7-12-1995

Filing Date

11-5-1993

Assignee at Filing

UCFRF

Filing Type

Nonprovisional Application Record

Donated

no

Share

COinS