University of California, San Diego 1984 - 1990
Lecturer, Instructor
Northrop Grumman Corporation 1988 - 1990
Software Engineer
San Diego Mesa College 1984 - 1990
Instructor
Boeing 1984 - 1990
Senior Technical Fellow
Education:
San Diego State University 1986 - 1989
San Diego State University 1984 - 1988
Master of Science, Masters, Computer Science
Elementary
National Chengchi University
Bachelors, Bachelor of Arts, Philosophy
Chinese Culture University
Doctorates, Masters, Master of Arts, Doctor of Philosophy, Philosophy
Skills:
Natural Language Processing Text Mining Artificial Intelligence Computer Science Systems Engineering Software Engineering Text Analytics Information Retrieval Machine Learning Data Mining System Architecture Knowledge Management Social Network Analysis Algorithms Aerospace Program Management Software Development Programming Distributed Systems Big Data Engineering Management Human Computer Interaction Data Analysis Link Analysis Simulations Visual Analytics Predictive Analytics Earned Value Management System Design Configuration Management Requirements Management Technical Leadership Object Oriented Design Perl Software Design High Performance Computing Semantic Web Image Processing Optimization Scalability Operations Research Avionics Hadoop Algorithm Design Lisp Enterprise Architecture Mathematical Modeling Space Systems
Fredrick Baden Holt - Seattle WA Anne Shu-Wan Kao - Bellevue WA Daniel John Pierce - Woodinville WA Stephen Robert Poteet - Bellevue WA Yuan-Jye Wu - Issaquah WA
Assignee:
The Boeing Company - Seattle WA
International Classification:
G06F 1700
US Classification:
706 45, 706 12, 706 46
Abstract:
Methods, apparatus and computer program products are provided for retrieving information from a text data collection and for classifying a document into none, one or more of a plurality of predefined classes. In each aspect, a representation of at least a portion of the original matrix is projected into a lower dimensional subspace and those portions of the subspace representation that relate to the term(s) of the query are weighted following the projection into the lower dimensional subspace. In order to retrieve the documents that are most relevant with respect to a query, the documents are then scored with documents having better scores being of generally greater relevance. Alternatively, in order to classify a document, the relationship of the document to the classes of documents is scored with the document then being classified in those classes, if any, that have the best scores.
Text Differentiation Methods, Systems, And Computer Program Products For Content Analysis
Anne S-W Kao - Bellevue WA, US Stephen R. Poteet - Bellevue WA, US Yuan-Jye Wu - Issaquah WA, US William Ferng - Sammamish WA, US Robert E. Cranfill - Seattle WA, US
Assignee:
The Boeing Company - Chicago IL
International Classification:
G06N 5/00
US Classification:
706 45, 706 47
Abstract:
Provided are improved methods, apparatus, and computer program products for text differentiation which involves identifying differences between documents with similar content, not merely similar terms, and generating results. Text differentiation provides the ability to find non-similar, or different, content hidden within documents with similar overall content, but not exactly the same content. Text differentiation may be used to quickly identify key differences between similar documents.
Anne S-W Kao - Bellevue WA, US Stephen R. Poteet - Bellevue WA, US Yuan-Jye Wu - Issaquah WA, US William Ferng - Sammamish WA, US Robert E. Cranfill - Seattle WA, US
Assignee:
The Boeing Company - Chicago IL
International Classification:
G06F 7/00 G06F 17/30
US Classification:
707736, 707772, 707917, 715254
Abstract:
A text summarizer identifies relevant terms in a document, weights the terms and extracts one or more segments to produce a summary or abstract. The various terms in a particular are weighted in relation to an existing document collection. A term weight computer computes term weights for terms in the document, and a threshold comparator compares the term weights to determine if the corresponding terms are relevant to the document collection. Next, a term weight summer adds the term weights for each occurrence of each relevant term in the various segments of the document, and a summation comparator compares the summations to identify a text summarization segment representative of the document. Optionally, relevant terms can be highlighted in the term summarization segment.
Text Summarization Method And Apparatus Using A Multidimensional Subspace
Yuan-Jye Wu - Issaquah WA, US Anne S-W Kao - Bellevue WA, US Stephen R. Poteet - Bellevue WA, US William Ferng - Sammamish WA, US Robert E. Cranfill - Seattle WA, US
Assignee:
The Boeing Company - Chicago IL
International Classification:
G06F 7/00 G06F 17/30
US Classification:
707736, 707772, 707917, 715254
Abstract:
A text summarizer identifies relevant terms in a document, weights the terms and extracts one or more segments to produce a summary or abstract. The various terms in a particular are weighted in relation to an existing document collection. A term weight computer computes term weights for terms in the document, and a threshold comparator compares the term weights to determine if the corresponding terms are relevant to the document collection. Next, a term weight summer adds the term weights for each occurrence of each relevant term in the various segments of the document, and a summation comparator compares the summations to identify a text summarization segment representative of the document. Optionally, relevant terms can be highlighted in the term summarization segment.
Streaming Text Data Mining Method And Apparatus Using Multidimensional Subspaces
Yuan-Jye Wu - Issaquah WA, US Anne S-W Kao - Bellevue WA, US Stephen R. Poteet - Bellevue WA, US William Ferng - Sammamish WA, US Robert E. Cranfill - Seattle WA, US
Assignee:
The Boeing Company - Chicago IL
International Classification:
G06F 7/00 G06F 17/00
US Classification:
707737, 707736, 707739, 706 45
Abstract:
A streaming text data comparator performs real-time text data mining on streaming text data. The comparator receives a streaming text data document and generates a vector representation of the term frequencies relating to an existing document collection. The comparator then transforms the term frequency vector into a projection in a precomputed multidimensional subspace that represents the original document collection. The comparator further calculates a relationship value representing the similarities or differences between the vector representation and the subspace, and compares the relationship value to a predetermined threshold to determine whether the streaming text data document is related to the original document collection. If the streaming text data document is related, the streaming text data comparator intercalates the new document into the document collection. If the new document is not related, the comparator may store or delete the unrelated document.
Anne S-W Kao - Bellevue WA, US Stephen R. Poteet - Bellevue WA, US Nicholas J. Multari - Snoqualmie WA, US
Assignee:
The Boeing Company - Chicago IL
International Classification:
H04L 29/06
US Classification:
726 27
Abstract:
A system generates rules for classifying documents are generated by building a vocabulary of features (e. g. , words, phrases, acronyms, etc. ) that are related to classifying concepts. The system includes a security document reader receives a security document that defines security concepts for a particular project and parses the security document to separate the security concepts. A vocabulary builder receives samples provided by the user that contain information related to the project. For each security concept, the vocabulary builder uses statistical analysis techniques to find features in the samples that are related to that concept. A rule generation assistant, for each security concept, generates rules based on the built vocabulary and the samples. The rule generation assistant uses statistical analysis techniques on the vocabulary and samples to determine features that optimally predict a particular concept. The rules can be used by a downgrader to process information to be distributed.
Automated Analysis And Summarization Of Comments In Survey Response Data
Stephen R. Poteet - Bellevue WA, US Anne Kao - Bellevue WA, US Shan Luh - Kenmore WA, US
Assignee:
The Boeing Company - Chicago IL
International Classification:
G06F 17/30
US Classification:
707737, 707738, 707776
Abstract:
Technologies are described herein for providing automated analysis and summarization of free-form comments in survey response data. A number of topic words are identified from the survey response comments, and a numeric weight is calculated for each topic word that reflects the relevance of the topic word to each comment. Each topic word is associated with one or more topics and the comments relevant to each topic is then determined based on the weights of the associated topic words in each comment. A report is generated which summarizes the topics and their relative importance in the survey response comments based upon the number of comments relevant to each.
Methods And Framework For Constraint-Based Activity Mining (Cmap)
A method of mining data to discover activity patterns within the data is described. The method includes receiving data to be mined from at least one data source, determining which of a number of specified interests and constraints are associated with the mining process, selecting corresponding mining agents that combine search algorithms with propagators from the specified constraints, and finding any activity patterns that meet the specified interests and constraints.