[Prev][Next][Index]
CSLI Calendar, 21 September 1989, Vol 5:1
-
Subject: CSLI Calendar, 21 September 1989, Vol 5:1
-
From: csli@csli.stanford.edu
-
Date: Wed 20 Sep 1989 17:06:41
C S L I C A L E N D A R O F P U B L I C E V E N T S
_____________________________________________________________________________
21 September 1989 Stanford Vol. 5, No. 1
_____________________________________________________________________________
A weekly publication of the Center for the Study of Language and
Information (CSLI), Ventura Hall, Stanford University, Stanford, CA 94305-4115
____________
CSLI ACTIVITIES FOR NEXT THURSDAY, 28 September 1989
12:00 p.m. TINLunch
Cordura Hall Document Image Analysis by Tree Structure Model
Conference Room Y. Nishimura, T. Takahashi, and Y. Kobayashi
ATR Communication Systems Research Laboratories
(nisimura@atr-sw.atr.co.jp)
Abstract below
2:15 p.m. CSLI Seminar
Cordura Hall Models of Rational Agency
Conference Room First meeting
Michael Bratman, Martha Pollack, Stan Rosenschein
(bratman@csli, pollack@warbucks.ai.sri.com,
stan@teleos.com)
Abstract below
____________
ANNOUNCEMENT
This is the first Calendar of the 1989-90 academic year, and we'd like
to welcome everyone back to CSLI's weekly activities, starting next
Thursday.
We would like to thank Emma Pease for having done a great job as
Editor of the Calendar for the last few years. Emma has taken on new
responsibilities within CSLI, and, starting with this issue, the
Calendar will be edited by Ingrid Deiwiks.
____________
NEXT WEEK'S TINLUNCH
Document Image Analysis by Tree Structure Model
Y. Nishimura, T. Takahashi, and Y. Kobayashi
(nisimura@atr-sw.atr.co.jp)
28 September 1989
In this presentation, I will first outline the organization of ATR
(Advanced Telecommunication Research Institute), which is located in
Kyoto, Japan, and consists of four laboratories:
- ATR Communication Systems Research Laboratories
- ATR Optical and Radio Communications Research Laboratories
- ATR Interpreting Telephony Research Laboratories
- ATR Auditory and Visual Perception Research Laboratories
Then, I will introduce the research conducted in our group at ATR
Communication Systems Research Laboratories. Finally, I will talk
about my research concerning next-generation page readers (OCRs), an
abstract of which is given below.
We propose a document image analysis method that can extract the
logical structure of scanned paper documents to obtain indices such as
titles, author names, etc. The aim of this research is to develop a
building block for next-generation page readers, which will be able to
capture not only the character codes, but also the layout and logical
structure of documents.
A major problem in document image analysis is segmenting document
images to extract the components (indices). We implement the
segmentation process as the top-down, model-driven matching process of
the model and the input image. For this purpose, we introduce a tree
structure model to represent the layout of each document type (such as
title pages of IEEE Trans. papers). The model also describes elements
of the page such as the body and running heads. The body is further
divided into the text and other components.
Using the model in a top-down fashion, first the running heads and
running foots are extracted. Next, the components other than the text
are extracted. The model describes these components in the way they
are segmented, thus the segmentation result of the input image matches
that of the model. The text, the part where uniformity is frequently
degraded, is extracted as the remaining part of the page.
We also introduce a model-building process that can be operated by a
novice user. In an experiment using 115 input documents from 38 types
of scientific paper title pages, every index in 85.2% of the input
documents was correctly extracted. In a comparison experiment using a
bottom-up segmentation process, the correct extraction rate was only
9.1%.
Applications of the method include document entry without rekeying
for electronic publishing systems, document retrieval, and automatic
indexing of scanned documents for document database systems.
____________
NEXT WEEK'S SEMINAR
Models of Rational Agency
Michael Bratman, Martha Pollack, Stan Rosenschein
First meeting
28 September 1989
This seminar will survey models of intelligent rational agents in
dynamic, multiagent environments. We will look at models couched in
terms of commonsense psychological notions like belief, desire,
intention, and plan. We will explore the relations between such
models and issues of implementation and design as they arise in
artificial intelligence. In particular, we will be concerned with the
implications of resource limitations for models of rational agency at
various levels of abstraction.