
[Date Prev][Date Next][Date Index]
CSLI Calendar, Wednesday, 28 April 2004, vol. 19:33
CSLI CALENDAR OF PUBLIC EVENTS
______________________________________________________________________
28 April 2004 Stanford Vol. 19, No. 33
______________________________________________________________________
A weekly publication of the
Center for the Study of Language and Information (CSLI)
Stanford University, Cordura Hall, Stanford, CA 94305-4115
http://www-csli.stanford.edu/
____________
ACTIVITIES FROM 28 April 2004 TO 7 MAY 2004
WEDNESDAY, 28 APRIL 2004
4:15pm CSLI Seminar on Computational Learning and Adaptation
Cordura Hall, room 100
"Constructing Features from High-Dimensional Data"
Geoffrey J. Gordon
Center for Automated Learning and Discovery, Carnegie Mellon
http://cll.stanford.edu/scla.html
Abstract below
4:15pm EE380: Computer Systems Lab Colloquium
Gates B03 (NEC Auditorium)
"Flexible Number Representations for Computing with FPGAs"
Oskar Mencer
Imperial College and Maxeler Technologies
http://ee380.stanford.edu/contents.html
Abstract below
4:30pm Psychology Talk
Jordan Hall 420:245
"The basal ganglia and processing beat-based rhythm in
musicians and non-musicians"
Jessica Grahn
MRC Cognition and Brain Sciences Unit, UK
Abstract below
THURSDAY, 29 APRIL 2004
12 noon Psychology Developmental Brownbags
Bldg. 420:050
"On-line processing of grammatically correct and anomalous
sentences in toddlers
Renate Zangl
Psychology, Stanford
http://www-psych.stanford.edu/news.html#dev_brownbag
12 noon RNI/Stanford Seminar Series on Theoretical Neuroscience
BioX/Clark Center, Room S360
"Inferential Processes and the Architecture of Visual Cortex"
Malcolm Young
University of Newcastle
http://www.ncl.ac.uk/biol/staff/profile/m.p.young
http://www.rni.org/seminar2.html
Abstract below
3:00pm Reuters Foundation Digital Vision Program Seminar
Reuters lounge, Cordura Hall
"Business Incubation - A Platform for Entrepreneur and ICT
Growth in Developing Countries"
Barbara L. Harley
http://cgi.stanford.edu/group/reuters/cgi-bin/calendar/index.cgi
4:00pm Personality Lab
Jordan Hall 420:419
"The body stripped down: An existential account of ambivalence
toward the physical body"
Jamie Goldenberg
UC Davis
http://www-psych.stanford.edu/news.html#person_lab
4:00pm SRI AI Seminar Series
EJ228, SRI International
"Towards an Interoperability Framework for Collaborative Tools"
Eugene Eric Kim
Blue Oxen Associates
http://www.ai.sri.com/seminars/
Abstract below
4:00pm UC Berkeley CIS Seminar
Soda Hall 310 (UC Berkeley)
"G Source Separation"
Hagai Attias
http://www.eecs.berkeley.edu/~ywteh/cis-seminar
Abstract below
4:15pm SSP10: Symbolic Systems Forum
Bldg. 380:380C (Math Corner)
"How Real Is the Future?"
John Perry
Philosophy, Stanford University
http://symsys.stanford.edu:8081/ssp-dynamic/servlet/ssp_events
Abstract below
4:15pm Information Systems Seminar
Packard 101
"Dynamic Power Control in a Wireless Channel Subject to a
Quality of Service Constraint"
Baris Ata
Kellogg School of Management
http://isl.stanford.edu/groups/seminar/
Abstract below
5:15pm Archaeology Research Workshop
Bldg 60:61H
"Genes vs agents:
a discussion of the widening theoretical gap in archaeology"
Kristian Kristiansen
Gothenburg University and the Stanford Archaeology Center
http://archaeology.stanford.edu/workshop.html
FRIDAY, 30 APRIL 2004
11:00am CS545: Database Seminar
Gates 104
"Active XML: A Data-Centric Perspective on Web Services"
Omar Benjelloun
INRIA-Futurs
http://www-db.stanford.edu/dbseminar/
Abstract below
(special talk)
12 noon Ethics@Noon
Bldg. 100:101K
"Ethical Writing: Borrowing, Stealing, and Owning"
Hilton Obenzinger
Undergraduate Research Programs and Department of English
http://www.stanford.edu/dept/EIS/lectures_ethics.html
2:00pm NLP Reading Group
Ventura 17
"Speed and Accuracy in Shallow and Deep Stochastic Parsing"
Ron Kaplan, Stefan Riezler, Tracy Holloway King, John T. Maxwell III,
Alexander Vassermann, and Richard Crouch (PARC)
http://www.parc.com/istl/members/riezler/PAPERS/NAACL04.pdf
http://www-nlp.stanford.edu/nlp/nlpgroup.html
Abstract below
3:15pm The Mathematical Representation of Nature
Location to be announced
"Beauty doth of itself persuade:
Quantization, mathematical beauty, and scientific understanding"
Michael Dickson
Indiana University
http://www.stanford.edu/dept/HPST/philosophyworkshop.html
3:15pm Friday Cognitive Seminar
Jordan Hall 420:050
Title to be announced
Ching Kao
http://www-psych.stanford.edu/news.html#frisem
3:30pm Linguistics Department Colloquium
Margaret Jacks Hall 460:126
"Narrativity and Differentiation:
Reading "between the lines" in the Cold War"
Susan Gal
University of Chicago
http://www-linguistics.stanford.edu/colloq/
Abstract below
SATURDAY, 1 MAY 2004
10:00am The Mathematical Representation of Nature
Terrace Room, 4th floor, Margaret Jacks Hall
"A case of mathematical uncooperativeness:
Classical field theories and the classical ideal of theories"
Mathias Frisch
University of Maryland
http://www.stanford.edu/dept/HPST/philosophyworkshop.html
2:00pm The Mathematical Representation of Nature
Terrace Room, 4th floor, Margaret Jacks Hall
"Hume and mechanics' complexities"
Mark Wilson
University of Pittsburgh
http://www.stanford.edu/dept/HPST/philosophyworkshop.html
MONDAY, 3 MAY 2004
12 noon UC Berkeley Developmental Psychology Colloquium
3105 Tolman (Berkeley)
"The Child Learns to Think for Speaking:
Puzzles of Crosslinguistic Diversity in Form-Meaning Mappings"
Dan Slobin
Psychology, UC Berkeley
http://psychology.berkeley.edu/admin/colloquia.html
4:15pm CS528: Broad Area Colloquium in AI,
Geometry, Graphics, Robotics, and Vision
TCSeq 200
"Differential topology and combinatorial algorithms, and where
this unlikely marriage works"
Herbert Edelsbrunner
Duke
http://graphics.stanford.edu/courses/cs528/
Abstract below
TUESDAY, 4 MAY 2004
2:45pm CS548: Internet and Distributed Systems Seminar
Skilling Auditorium
"High Availability Techniques and Case Studies from USBank"
Greg Messer
Manager, Enterprise Performance Engineering,
http://cs548.stanford.edu/schedule.shtml
Abstract below
3:00pm Reuters Foundation Digital Vision Program Seminar
Reuters lounge, Cordura Hall
"Democracy: Coming Soon to a Computer Near You"
Todd Davies
http://cgi.stanford.edu/group/reuters/cgi-bin/calendar/index.cgi
4:15pm SNRC Industry Seminar
Gates B03
"Mars Rover Information Portal"
Ronald Mak
NASA Ames Research Center
http://snrc.stanford.edu/events/industry-seminar/
Abstract below
WEDNESDAY, 5 MAY 2004
12 noon UC Berkeley Anthropology Talks
Seminar Room, 2251 College, ARF (Berkeley)
"Learning and stone tool production in Paleolithic France"
Kathy Sterling
http://anthropology.berkeley.edu/news.html
3:45pm Psychology Department Colloquium
Jordan Hall 420:041
Title to be announced
Ken Kendler
Virginia Commonwealth University
http://www-psych.stanford.edu/news.html
4:15pm EE380: Computer Systems Lab Colloquium
Gates B03 (NEC Auditorium)
"Scalable Visual Comparison of Biological Trees and Sequences"
Tamara Munzner
University of British Columbia
http://ee380.stanford.edu/contents.html
Abstract below
THURSDAY, 6 MAY 2004
12:15pm CSLI CogLunch
Cordura Hall, Room 100
Title to be announced
Paul Kay
http://www-csli.stanford.edu/events/Coglunch/
3:00pm Reuters Foundation Digital Vision Program Seminar
Reuters lounge, Cordura Hall
"Designing a Sea Change"
Lauralee Alben
http://cgi.stanford.edu/group/reuters/cgi-bin/calendar/index.cgi
4:00pm Personality Lab
Jordan Hall 420:419
"'I just can't stop thinking about it...': Information-processing
biases, cognitive inhibition, and rumination in depression"
Jutta Joormann,
Psychology, Stanford
http://www-psych.stanford.edu/news.html#person_lab
4:00pm SRI AI Seminar Series
EJ228, SRI International
Title to be announced
Elin Ronby Pedersen
http://www.ai.sri.com/seminars/
4:15pm SSP10: Symbolic Systems Forum
Bldg. 380:380C (Math Corner)
"Engineering Deep Grammars"
Tracy King
Palo Alto Research Center
Consulting Associate Professor, Symbolic Systems Program
http://symsys.stanford.edu:8081/ssp-dynamic/servlet/ssp_events
Abstract below
4:15pm Information Systems Seminar
Packard 101
"Time Synchronization and Distributed Modulation in Large
Sensor Networks (or 'How to Make a Large Random Field by
Stitching Together Very Many Small Ones')"
Sergio D. Servetto
Cornell University
http://isl.stanford.edu/groups/seminar/
Abstract below
5:30pm Stanford Phonology Workshop
Margaret Jacks Hall 460:126
"A French Dialect Continuum Across North America"
Luc Baronian
Stanford University
http://www-linguistics.stanford.edu/pinterest/
Abstract below
FRIDAY, 7 MAY 2004
12:30pm CS547: Human-Computer Interaction Seminar
Gates B01
"Understanding the Requirements for Developing and Designing
Free/Open Source Software"
Walt Scacchi
UC Irvine
http://www-pcd.stanford.edu/cs547/
Abstract below
2:00pm NLP Reading Group
Ventura 17
"Unsupervised multilingual sentence boundary disambiguation"
Jan Strunk
Linguistics, Stanford
http://www-nlp.stanford.edu/nlp/nlpgroup.html
Abstract below
3:15pm Philosophy Department Colloquium
Bldg. 90:92Q
Title to be announced
Dorothea Frede
University of Hamburg/Stanford CASBS
http://www-philosophy.stanford.edu/ce.html
3:15pm Friday Cognitive Seminar
Jordan Hall 420:050
Title to be announced
Michelle Gumbrecht
http://www-psych.stanford.edu/news.html#frisem
3:30pm Semantics and Pragmatics Workshop
Margaret Jacks Hall 460:126
Title to be announced
Mark Baker
Rutgers
http://www-linguistics.stanford.edu/semgroup/
____________
Stanford Blood Center status: Shortage of A+, O-, and O+. For an
appointment: http://bloodcenter.stanford.edu/ or call 650-723-7831.
It only takes an hour of your time.
____________
CSLI SEMINAR ON COMPUTATIONAL LEARNING AND ADAPTATION
on Wednesday, 28 April 2004, 4:15pm to 5:30pm
Cordura 100
http://cll.stanford.edu/scla.html
"Constructing Features from High-Dimensional Data"
Geoffrey J. Gordon
Center for Automated Learning and Discovery, Carnegie Mellon
An important machine learning problem is feature construction:
learning functions which reduce raw, high-dimensional data to a
lower-dimensional set of features, while still retaining as much
information from the raw data as possible. This sort of dimensionality
reduction can help filter out noise, and it can be an important
pre-processing step before running learning algorithms whose
computational complexity or sample complexity grow quickly with input
dimensionality. In this talk, I will describe our experiments with
feature construction from data recorded using our team of mobile
robots. These algorithms include nonlinear principal components
analysis, which we have used to compress belief states in planning
problems, and predictive state representations, which we have used to
compress time series of laser rangefinder readings. While our
experiments are based on robot sensor data, the algorithms and results
should be relevant to other applications of dimensionality reduction
such as text, link structure, or images.
____________
EE380: COMPUTER SYSTEMS LAB COLLOQUIUM
on Wednesday, 28 April 2004, 4:15pm to 5:30pm
NEC Auditorium (B03), Gates Computer Science Building
http://ee380.stanford.edu/contents.html
"Flexible Number Representations for Computing with FPGAs"
Oskar Mencer
Imperial College and Maxeler Technologies
As programmable hardware matures into a competitive platform for
computation, we are faced with the opportunity to optimize the number
representations possibly for each variable of our programs.
Dynamically, we can even adapt the number representation to input data
and phases of the program.
Adapting the number representation consists of two parts: (1) the
encoding, and (2) the number of bits used to represent the values. We
are here not limited to standard encodings such as integer or IEEE
floating point, but can adapt the encoding to the function that is
being computed. The second step is then to find the precision
requirement which we can accomplish, for example, via automatic
differentiation of the program.
This research is enabled by ASC, A Stream Compiler, which provides a
hardware programming environment for FPGAs. The programmer gets access
to the programmable hardware on the algorithm level, but also on the
architecture, arithmetic and gate levels, all within a single C++
program. As a consequence, ASC combines the performance of hand
designed circuits with the productivity of the software development
process.
About the speaker: Oskar Mencer teaches at Imperial College in London
and, on the side, runs Maxeler Technologies, a startup commercializing
ASC technology. (See http://www.maxeler.com for information about
Maxeler Technologies.)
He received his Ph.D. and M.S. in Electrical Engineering from Stanford
University in 2000 and 1997 respectively, and a B.S. degree in
Computer Engineering from the Technion/Israel in 1994. His research
interests span computer architecture, computer arithmetic, VLSI
microarchitecture, CAD and reconfigurable (custom) computing. More
specifically, he is interested in improving the programmability of
FPGAs and making the compute power of reconfigurable computers
accessible to the computer science community.
____________
PSYCHOLOGY TALK
on Wednesday, 28 April 2004, 4:30pm
Jordan Hall 420:245
"The basal ganglia and processing beat-based rhythm in
musicians and non-musicians"
Jessica Grahn
MRC Cognition and Brain Sciences Unit, UK
Regular rhythmic patterns engage us. We tap our toes and bob our heads
to the beat, clap our hands or drum on the steering wheel. This
behaviour has led to the suggestion that we may use a beat-based
timing mechanism for these types of regular temporal patterns, and an
interval-based timer when temporal patterns have no regular beat. In
beat-based timing, neural machinery synchronizes to a perceived beat,
and durations to be timed are encoded and reproduced in reference to
this beat. The interval timer encodes individual durations, recalling
them at any time. Beat-based timing may offer an accuracy benefit or
working memory load reduction, though behavioural evidence for this
has been conflicting. This talk presents both behavioural and neural
evidence for the existence of beat-based timing. The results indicate
that beat-based timing does exist, and is mediated by the basal
ganglia in both musicians and non-musicians. In addition, musicians
show higher levels of activation in brain areas not specifically
involved in beat-based timing.
About the Speaker: Jessica Grahn is a PhD student working with Matthew
Brett and Robert Carlyon at the MRC Cognitive and Brain Sciences Unit.
____________
RNI/STANFORD SEMINAR SERIES IN THEORETICAL NEUROSCIENCE
on Thursday, 29 April 2004, 12 noon - 1pm
BioX/Clark Center, Room S360
http://www.rni.org/seminar2.html
"Inferential Processes and the Architecture of Visual Cortex"
Malcolm Young
University of Newcastle
http://www.ncl.ac.uk/biol/staff/profile/m.p.young
A striking aspect of vision is that observers do not see what their
eyes tell their brains (e.g. Young 1994). This mismatch is clearly
evident in the fact that apparently detailed coloured texture is
seen in extra-foveal vision, where such information is very modest
or absent in the input from the retina, and in other cases, where
information at one location and time influences the likelihood that
information will be present at a different location, and at a later
time. These considerations, and detailed observations of what
neuroanatomy can tell us of the processing architecture in visual
cortex, suggest that the visual system may undertake an inferential
style of computation, rather than analysing bottom-up information
from the eyes, which latter model is in the ascendant in visual
neurophysiology.
We have been exploring the different predictions that the inference
and traditional models make, when the prior probabilities of stimuli
are manipulated, as they are during normal vision. In one paradigm,
we use spatiotemporal sequences of oriented bars to provide the visual
system with "reason to believe" that a bar of particular orientation
will be presented at a particular time. We then vary the congruence,
incongruence and presence of the bar in the RF in relation to these
prior expectations. The results so far are striking: V1 neurons are
often more modulated by these prior and distant events - in many cases
more than 400ms, and 6-10 degrees away from their classical RF - than
they are by events within their RF. Similar results relate to an
experimental paradigm that tests whether V1 neurons really suffer from
the "aperture problem." The interaction between priors and likelihood
functions is well fitted by a Bayesian model, and only poorly fitted
by the traditional view that V1 cells are local filters of one kind or
another. Similarly, Bayesian models fit better than mismatch models,
such as that of Rao and Ballard. These results suggest that V1
neurons are signaling the posterior probability of visual stimuli,
on the basis both of prior knowledge about the world and information
from the eye, effectively imputing feature constellations to the
visual world, rather than simply analysing local contours.
About the Speaker: Professor Malcolm Young is a Provost of Science,
Agriculture and Engineering at the University of Newcastle in England.
His research addresses the question of how the brain works by
investigating how the brain is organized anatomically; what processes
go on inside it; how these processes interact to cause behaviour,
particularly visual behaviour; and what biological constraints must
be met by the brain. His laboratory was instrumental in developing
the neuroinformatics approach - computer-based collation, management,
and analysis of neuroscience data.
____________
SRI AI SEMINAR SERIES
on Thursday, 29 April 2004, 4:00pm - 5:30pm
EJ228, SRI International
http://www.ai.sri.com/seminars/
"Towards an Interoperability Framework for Collaborative Tools"
Eugene Eric Kim
Blue Oxen Associates
Almost 40 years ago, Doug Engelbart demonstrated the first
collaborative software tools at a conference in San Francisco. The
demonstration stunned the audience, who gave Engelbart a standing
ovation. In many ways, today's collaborative tools aren't
significantly better than the software demonstrated that day. One of
the biggest problems is that these tools do not interoperate well with
each other. Kim will discuss the need for a shared conceptual
framework for collaborative tools, which will help reveal new
possibilities for interoperability and improvement.
About the Speaker: Eugene is Executive Director and cofounder of Blue
Oxen Associates, a socially-conscious think tank devoted to improving
collaboration and knowledge management. Previously, he worked as an
independent consultant, specializing in software development, project
management, and strategic planning, and before that, he was Senior
Technical Editor at Dr. Dobb's Journal. He worked closely with Doug
Engelbart on the Open Hyperdocument System, and served on the Core
Planning Committee of Engelbart's Bootstrap Alliance. Eugene has
written for a variety of publications, and is currently writing a book
on the history of free software. Eugene received his A.B. in History
and Science from Harvard University.
____________
UC BERKELEY CIS SEMINAR
on Thursday, 29 April 2004, 4:00pm-5:00pm
Soda Hall 310 (UC Berkeley)
http://www.eecs.berkeley.edu/~ywteh/cis-seminar
"G Source Separation"
Hagai Attias
Some graphical model based algorithms for source separation have two
separate versions, one for clean data (zero noise mixing) and the
other for noisy data (mixing with additive noise). One such algorithm
is independent factor analysis (IFA). One might expect noisy IFA to
reduce to clean IFA as the noise level approaches zero. However, noisy
IFA reduces to PCA in that limit, and performs poorly on low noise
data where clean IFA performs well. I will discuss a simple method,
called the G-trick, for deriving a new noisy IFA algorithm that
performs optimally for any noise level, and reduces to clean IFA in
the zero noise limit. I will also discuss extensions to reverberant
data (convolutive mixing) and to incorporating rich source models.
____________
SSP10: SYMBOLIC SYSTEMS FORUM
on Thursday, 29 April 2004, 4:15pm
Bldg. 380:380C (Math Corner)
http://symsys.stanford.edu:8081/ssp-dynamic/servlet/ssp_events
"How Real Is the Future?"
John Perry
Philosophy, Stanford University
John McTaggart Ellis McTaggart argued that time is not real. He thought
that the concept of time requires that events have, successively, the
properties of being future, being present, and being past; this is the
source of temporal change, the essence of time. These are A-properties.
The 2000 election was once in the future, then it was present, and
now it is in the past. But our concept of time also requires that all
temporal properties derive from the order of events in time; the 2000
election was after the 1996 election and before the 2004 election.
But these order properties, the B-properties, never change, and so
cannot be the source of the changing properties A-properties. So the
whole thing is a unreconcilable mess, and the concept of time makes no
sense, so there is no time.
Contemporary philosophers of time tend to agree that time is real, but
don't agree on what to do about the properties of being future,
present, and past. Some argue that these are indexical or
token-reflexive properties. When I say now "The 2004 election is in
the future," the condition of truth on my remark is that the 2004
election is later than my remark. The relation between the 2004 and
my remark never change. So there is no problem finding the fact that
makes my statement true in the B-series. Others argue that this
analysis leaves the nature of time utterly mysterious, since temporal
change, the essence of time, is left out. We find the A-series within
the B-series at the cost of losing the essence of time. Moreover, it
seems that on this analysis there would be no past, present, or future
without utterances, or at least thoughts, for events to be before,
simultaneous with, or later than.
I'll argue that the indexical analysis of the properties of being
present, past, and future is not right. There are no future events,
for events are not real, do not exist, until they happen. If there is
no event later than an event E, E is present. If there are events
later than it, E is past. Events go from being present to being past
in virtue of new events coming into existence. The B-series facts
change just as the A-series facts do, and are the basis for them.
Time is real, temporal change is unique, and it does not require
utterances or thoughts. Confusion on these matters is due, among
other things, to not distinguishing representations, models, and
reality, and to the overly facile way we talk about propositions being
true at times.
This talk covers work I am doing with Thomas Hofweber.
About the Speaker: John Perry is Henry Waldgrave Stuart Professor of
Philosophy at Stanford University.
____________
INFORMATION SYSTEMS SEMINAR
on Thursday, 29 April 2004, 4:15pm-5:15pm
Packard 101
http://isl.stanford.edu/groups/seminar/
"Dynamic Power Control in a Wireless Channel
Subject to a Quality of Service Constraint"
Baris Ata
Kellogg School of Management
A controller dynamically chooses a state-dependent transmission rate
on a static, point-to-point wireless link by varying transmission
power over time. The transmitter is modeled as a finite-buffer
Markovian queue with adjustable service rates. That is, data packets
arrive to the system according to a Poisson process and packet size is
exponentially distributed. The controller chooses a transmission rate
from a fixed set $A$ of available values, depending on the backlog in
the system. The objective is to minimize long-run average energy
consumption subject to a quality of service constraint, which is
expressed as an upper bound on the packet drop rate. An explicit
formula is developed for the optimal transmission rate as a function
of the packet queue length. In the second half of the talk I will
extend the model and the results to a fading channel.
____________
CS545: DATABASE SEMINAR
on Friday, 30 April 2004, 11:00am - 12 noon
Gates 104
http://www-db.stanford.edu/dbseminar/
(special seminar, note unusual time and place)
"Active XML: A Data-Centric Perspective on Web Services"
Omar Benjelloun
INRIA-Futurs
This talk introduces Active XML (AXML, for short), a declarative
framework that harnesses Web services for data integration, and is put
to work in a peer-to-peer architecture.
An AXML document is an XML document that may contain embedded calls to
Web services. An AXML "peer" is a repository of AXML documents. On
the one hand, it acts as a client, by invoking the calls to Web
services embedded in its documents. On the other hand, a peer acts as
a server, by providing Web services that can be declaratively
specified as queries over the AXML documents it contains. The AXML
approach allows to gracefully combine stored information with data
defined in an intensional manner (as service calls). The fact that
peers can exchange a mixture of materialized and intensional data (via
AXML documents) leads to a very powerful distributed data management
paradigm.
The AXML approach induces a number of technically challenging
problems, both theoretical and practical, which I will describe in the
talk. I will also present a prototype that we developed, and mention
some interesting applications.
____________
NLP READING GROUP
on Friday, 30 April 2004, 2:00pm - 3:30pm
Ventura 17
http://www-nlp.stanford.edu/nlp/nlpgroup.html
"Speed and Accuracy in Shallow and Deep Stochastic Parsing"
Ron Kaplan, Stefan Riezler, Tracy Holloway King, John T. Maxwell III,
Alexander Vassermann, and Richard Crouch (PARC)
http://www.parc.com/istl/members/riezler/PAPERS/NAACL04.pdf
This paper reports some experiments that compare the accuracy and
performance of two stochastic parsing systems. The currently popular
Collin s parser is a shallow parser whose output contains more
detailed semantically-relevant information than other such parsers.
The XLE parser is a deep-parsing system that couples a Lexical
Functional Grammar to a log-linear disambiguation component and
provides much richer representations. We measured the accuracy of both
systems against a gold standard of the PARC 700 dependency bank, and
also measured their processing times. We found the deep-parsing system
to be more accurate than the Collins parser with only a slight
reduction in parsing speed.
____________
LINGUISTICS DEPARTMENT COLLOQUIUM
on Friday, 30 April 2004, 3:30pm
Margaret Jacks Hall, Room 460:126
http://www-linguistics.stanford.edu/colloq/
"Narrativity and Differentiation:
Reading "between the lines" in the Cold War"
Susan Gal
University of Chicago
We know that linguistic differentiation (style) signals the
differentiation of social identities. This insight has been developed
very effectively with reference to non-referential indexes (i.e.
phonological and sometimes syntactic variants). Personal narratives,
by contrast, have usually been analyzed as a means for creating
identity. In the study of narrative strategies, there has been little
attention to processes of social differentiation. My aim is to
examine a Cold War text -- the novelistic representation of a
bureaucratic encounter in Budapest -- in order to show how we can
watch social differentiation emerge through the poetic organization of
conversational interaction in narrative. The study of this text
proves ethnographically revealing too, showing the forms of coercion,
collusion and collaboration that were the hallmarks of state-citizen
relations in the communist period in eastern Europe.
____________
CS528: BROAD AREA COLLOQUIUM FOR
AI-GEOMETRY-GRAPHICS-VISION-ROBOTICS
on Monday, 3 May 2004, 4:15pm
TCSeq 200
http://graphics.stanford.edu/courses/cs528/
"Differential topology and combinatorial algorithms,
and where this unlikely marriage works"
Herbert Edelsbrunner
Arts and Sciences Professor of Computer Science and Mathematics, Duke
In this talk, I will present some of our recent work in computational
topology. I will introduce concepts from differential topology (Morse
functions, Reeb graphs, Jacobi sets, ...) and algebraic topology
(Betti numbers, persistence, ...) and discuss how they can be computed
for piecewise linear data. There are numerous applications of these
ideas and I will focus primarily on problems in structural biology.
About the Speaker: Herbert Edelsbrunner received his Ph.D. in
Technical Mathematics in 1982 from the University of Technology in
Graz, Austria. He was faculty at the University of Illinois at
Urbana-Champaign from 1985 until he joined Duke University in
1999. His primary research interests are in algorithms, geometry and
topology, and structural biology. He published two textbooks in
computational geometry. In 1996, he co-founded Raindrop Geomagic, a
software company that specializes in geometric modeling and shape
reconstruction from scan data.
____________
CS548: INTERNET AND DISTRIBUTED SYSTEMS SEMINAR
on Tuesday, 4 May 2004, 2:45pm - 4:00pm
Skilling Auditorium
http://cs548.stanford.edu/schedule.shtml
"High Availability Techniques and Case Studies from USBank"
Greg Messer
Manager, Enterprise Performance Engineering, US Bancorp
I will describe the practice of "Systems Performance Engineering and
Management" in business and real-time settings. The reason for having
a "Performance Engineering and Management" discipline in industry is
to ensure that an application's performance requirements are met prior
to being released to production, and continue to do so while in
production. I will discuss the science behind the practice, how it is
implemented within its methods, software and roles, and the skill set
necessary to do it. In addition, two case studies will be presented:
the first from a business application point of view, the second from a
real-time service point of view.
____________
SNRC INDUSTRY SEMINAR
on Tuesday, 4 May 2004, 4:15pm
Gates B03
http://snrc.stanford.edu/events/industry-seminar/
"Mars Rover Information Portal"
Ronald Mak
Computer Scientist - NASA Ames Research Center
http://www.arc.nasa.gov/
The Collaborative Information Portal is mission-critical software
developed at NASA Ames for the current Mars Exploration Rover
mission. CIP is used inside mission control at the Jet Propulsion
Laboratory and by NASA scientists and researchers throughout the
world. It displays current staff and event schedules and clocks that
show times in various Mars and Earth time zones. CIP users can
securely download and view data and image files from the mission file
servers, and they can send and receive broadcast messages. CIP plays
a major role in ensuring that mission personnel work together
effectively and efficiently, especially since the mission runs on Mars
time.
CIP is a three-tier enterprise application based on industry-standard
Java technologies (Swing, J2EE, etc.) and web services. This talk
will present an overview of the Mars mission and discuss CIP's
architecture and key components. We'll take a closer look at the CIP
middleware, which handles the remote service calls from hundreds of
client applications.
About the Speaker: Ronald Mak is a computer scientist employed by the
University of California at Santa Cruz. He works in the University
Affiliated Research Center (UARC) at the NASA Ames Research Center.
Ron was the architect and lead for the CIP middleware, and currently,
he does research and development at Ames and mission support at
JPL. Prior to NASA, he spent over 15 years developing enterprise
software in industry. Ron has written books on numerical computing and
on compiler writing, and he has a B.S. in the Mathematical Sciences
and a M.S. in Computer Science from Stanford University.
____________
EE380: COMPUTER SYSTEMS LAB COLLOQUIUM
on Wednesday, 5 May 2004, 4:15pm to 5:30pm
NEC Auditorium (B03), Gates Computer Science Building
http://ee380.stanford.edu/contents.html
"Scalable Visual Comparison of Biological Trees and Sequences"
Tamara Munzner
University of British Columbia
We present several visualization systems for comparing and browsing
evolutionary trees and genomic sequences. These systems use the
Focus+Context navigational metaphor of allowing users to fluidly
stretch and shrink parts of the view, as if manipulating a rubber
sheet with the borders tacked down. We introduce cognitive scalability
to this approach by guaranteeing the visibility of landmarks at all
times, so that users can stay oriented as they explore complex
datasets. In our systems, landmarks can be regions of difference
between datasets, or the results of a search, or user-chosen regions.
This technique, which we call "accordion drawing", supports smooth
realtime transitions between a big-picture overview and a drilled-down
views that show details in context.
TreeJuxtaposer is a system that allows users to compare large trees of
several hundred thousand nodes. We propose a new methodology for
detailed structural comparison between two trees and provide a new
nearly-linear algorithm for computing the best corresponding node from
one tree to another. SequenceJuxtaposer is a sequence visualization
tool for the exploration and comparison of biomolecular sequences.
SequenceJuxtaposer supports interaction at 20 frames per second when
browsing a single sequence of over 1.2 million base pairs, or large
collections of sequences up to 2 million total base pairs. In both
systems, all runtime rendering algorithms are sublinear in the total
number of tree nodes or base pairs in the dataset, and all
preprocessing is subquadratic in those variables. Our rendering
algorithms accomodate high-resolution displays as well as standard
screen sizes. With these systems, we are able to quickly observe many
features that had previously required significant analysis to
discover.
We also present the TJC and TJC-Q systems, which allow users to browse
extremely large trees using the accordion drawing technique. TJC is a
system that supports browsing trees up to 15 million nodes by
exploiting leading-edge graphics hardware while TJC-Q allows browsing
trees up to 5 million node on commodity platforms. Both of these
systems use a fast new algorithm for drawing and culling and benefit
from a complete redesign of all data structures for more efficient
memory usage and reduced preprocessing time. In addition to browsing
evolutionary trees, these tools are also useful in many other
application domains where there is a demand for large-scale tree
browsing, including network management, software engineering, database
integration, and genealogy.
About the speaker: Tamara Munzner has been an assistant professor in
the University of British Columbia Department of Computer Science
since 2002. Her current research interests are information
visualization, graph drawing, dimensionality reduction, and
interactive computer graphics. She was a research scientist from 2000
to 2002 at the Compaq Systems Research Center in California, and
earned her PhD from Stanford between 1995 and 2000. She was on the
technical staff of The Geometry Center, a mathematical visualization
research group at the University of Minnesota, from 1991 to 1995.
____________
SSP10: SYMBOLIC SYSTEMS FORUM
on Thursday, 6 May 2004, 4:15pm
Bldg. 380:380C (Math Corner)
http://symsys.stanford.edu:8081/ssp-dynamic/servlet/ssp_events
"Engineering Deep Grammars"
Tracy King
Researcher, Palo Alto Research Center
Consulting Associate Professor, Symbolic Systems Program
Parsing systems based on deep grammars mark explicitly a variety of
syntactic and semantic dependencies and should therefore provide
crucial support for meaning-sensitive NLP applications. However,
common wisdom has it that parsing systems based on deep linguistic
grammars are too difficult to produce, lack coverage and robustness,
are massively ambiguous, and have poor run-time performance. In this
talk, I will discuss ways in which deep grammars can be engineered to
overcome these problems, focusing on techniques used in the
implementation of the broad-coverage LFG-based grammars produced by
the ParGram project.
About the Speaker: Tracy Holloway King is a member of the research
staff at the Palo Alto Research Center and a consulting associate
professor with the Symbolic Systems Program.
____________
INFORMATION SYSTEMS SEMINAR
on Thursday, 6 May 2004, 4:15pm-5:15pm
Packard 101
http://isl.stanford.edu/groups/seminar/
"Time Synchronization and Distributed Modulation
in Large Sensor Networks
(or 'How to Make a Large Random Field by
Stitching Together Very Many Small Ones')"
Sergio D. Servetto
Cornell University
Joint work with An-swol Hu
We study limit properties of random processes obtained as a properly
scaled sum of a large number of randomly shifted pulses. Such
processes arise naturally in a sensor networking application, where a
large number of power-constrained radio transmitters coordinate their
access to a Gaussian multiple access channel, to cooperate in
generating a strong information-bearing waveform. In this talk we will
see how, under very mild technical assumptions easily met in practice,
the resulting limit waveform has some useful deterministic properties:
it is continuous, with known roots and polarity. We will also see how
these properties lead to efficient and practical implementations of
two essential primitive sensor networking functions: global time
synchronization, and reachback communication. Time permitting, we will
discuss connections with some old problems in network information
theory (and some new ones too), and we will overview experimental work
on these topics currently under way.
____________
STANFORD PHONOLOGY WORKSHOP
on Thursday, 6 May 2004, 5:30pm
Margaret Jacks Hall 460:126
http://www-linguistics.stanford.edu/pinterest/
"A French Dialect Continuum Across North America"
Luc Baronian
Stanford University
Besides presenting interesting grammatical phenomena, the study of the
French dialects and creoles of the Americas involves complex contact
situations between Africans, (Native) Americans and Europeans, and poses a
significant challenge for language change and classification. Therefore
sociolinguists, historical linguists and even theoretical linguists have
much to learn from their study.
Based on striking resemblances, Hull (1968) first proposed that the French
dialects and creoles of the colonies were all descendent from a maritime
jargon used in the ports of Western France. While linguists who have
concentrated on the genesis of Quebec French (Barbaud, Asselin &
McLaughlin, Morin, Poirier) have all rejected this strong hypothesis at
least for Quebec, a satisfactory global account of the genesis of these
languages and their relation to one another as well as to 17th century
dialects of French is still lacking.
I propose a traditional answer to part of this larger question, based
on recent fieldwork in Louisiana and research at the Library of
Congress on old recordings of former Missouri and Indiana varieties of
French. I show that a former dialect continuum emerges from Canada to
Louisiana, along the St Lawrence and Mississippi rivers.
The results are similar to Andrew Garrett's account of the origin of
Greek, where he proposes that closely related Indo-European dialects
gradually moved closer together by undergoing the same phonological
changes. In the case of the North American French dialects, the
evidence suggests that distinct dialects underwent regional changes
that gradually brought them closer to neighboring dialects, while
distancing them from dialects from which they were originally closer.
____________
CS547: HUMAN-COMPUTER INTERACTION SEMINAR
on Friday, 7 May 2004, 12:30-2:00pm
Gates B01
http://www-pcd.stanford.edu/cs547/
"Understanding the Requirements for
Developing and Designing Free/Open Source Software"
Walt Scacchi
Institute for Software Research, UC Irvine
In this talk, I present findings from an ongoing empirical study of
social processes, technical system configurations, organizational
contexts, and interrelationships that give rise to free/open source
software. The focus is directed at understanding what the requirements
for designing F/OSS may be. Multiple kinds of software development
informalisms are found to play a critical role in the elicitation,
analysis, specification, validation, and management of the
requirements and design of F/OSS systems, artifacts, processes,
project portals, and communities. Subsequently, understanding the
roles these informalisms take in the development process for F/OSS is
the focus of this presentation.
Research publications and related information at:
http://www.isr.uci.edu/research-open-source.html
About the Speaker: Walt Scacchi is senior research computer scientist
and research faculty member at the Institute for Software Research,
and director of research for the Laboratory for Game Culture and
Technology, at the University of California Irvine. He received his
Ph.D. in Information and Computer Science at UC Irvine in 1981. He
joined ISR in 1999 after serving on the faculty at the University of
Southern California for 18 years. From 1981 to 1991 he founded and
directed the USC System Factory, and from 1993 to 1998 he directed the
USC ATRIUM Laboratory. His interests include open source software
development, software process engineering, software acquisition and
electronic commerce, and organizational studies of system
development. He is an active researcher with more than 100 research
papers, and consults widely to clients in industry and government
agencies.
____________
NLP READING GROUP
on Friday, 7 May 2004, 2:00pm - 3:30pm
Ventura 17
http://www-nlp.stanford.edu/nlp/nlpgroup.html
"Unsupervised multilingual sentence boundary disambiguation"
Jan Strunk
Linguistics, Stanford
The detection of sentence boundaries constitutes a preliminary, yet
important step in electronic text processing. A majority of linguistic
analyses and methods in computational linguistics posit the sentence
as a crucial unit. What is more, sentence boundary detection itself
can be seen as a typical instance of one of the major problems in
computational linguistics: ambiguity resolution. In many languages of
the world, the marker for the sentence boundary, the period is not
used unequivocally. In particular, it is also used to mark
abbreviations. To make this ambiguity even more problematic the period
can as well be used to mark an abbreviation at the end of a sentence.
Although being a comparatively minor task in language processing,
sentence boundary detection has received some attention in the past
decade, as can be witnessed by approaches such as Palmer/Hearst
(1997), Grefenstette (1999), Schmid (2000) and Mikheev (2002). It is a
common property of these approaches that the developed methods have
been applied to a small range of languages only, usually covering one
to three languages. In such a setting, it remains unclear how well the
suggested methods operate if more languages or different types of
corpora are considered.
I will present an approach to sentence boundary detection that I
developed together with Tibor Kiss at the University of Bochum which
is supposed to be language and corpus independent and determines
sentence boundaries with high accuracy. We assume that the
determination of abbreviations is an easier task than the detection of
sentence boundaries, and base our approach on the detection of
abbreviations as a first step. This approach does not make use of
additional annotations, POS tagging, or precompiled lists to support
sentence boundary detection. Also, it does not use orthographical
information as primary evidence and is thus also suited to process
single-case text. It focuses on robustness and flexibility in that it
can be applied to a variety of languages without any adjustment. The
basic algorithm has been determined experimentally on the basis of
large unannotated corpora from German and English. We have applied the
results to further corpora from German and English as well as to
corpora from nine other languages (Brazilian Portuguese, Dutch,
Estonian, French, Italian, Norwegian, Spanish, Swedish, and Turkish).
In the evaluations reported, the mean accuracy of sentence boundary
detection is 98.47 %, the mean accuracy for abbreviation detection is
99.05%.
I will give an introduction into our general approach to abbreviation
detection and the additional heuristics we use to detect abbreviations
at the end of sentences and some more special cases. I will also
report on the results we have obtained when we tested our algorithm on
eleven different languages.
Grefenstette, Gregory. (1999). Tokenization.. In van Halteren, H.
(Ed.) Syntactic Wordclass Tagging. Dordrecht: Kluwer Academic
Publishers, pp. 117-133.
Mikheev, Andrei (2002). Periods, Capitalized Words, etc. Computation
Linguistics 28(3), pages 289-318.
Palmer, David D. and Matti A. Hearst (1997). Adaptive multilingual
sentence boundary disambiguation. Computational Linguistics 23(3),
pages 241-267.
Schmid, Helmut (2000). Unsupervised Learning of Period Disambiguation
for Tokenisation. Internal Report, IMS, University of Stuttgart.
http://www.stanford.edu/~jstrunk/pubeng.html
____________
END MATERIAL
The CSLI Calendar appears weekly on Wednesdays throughout the academic
year. Announcements, abstracts, and other information to appear in
the Calendar should be submitted to the editor, who reserves the right
to decide what does or does not go in the calendar
mailto:incalendar@csli.stanford.edu
Requests to be added to the mailing list should be sent to
majordomo@csli.stanford.edu. With the lines in the body of the text
of either
subscribe csli-calendar
for the long form or
subscribe csli-short-calendar
for the short form (i.e., no abstracts). Problems with subscribing or
unsubscribing should be sent to
owner-csli-calendar@csli.stanford.edu.
The full current issue is at
http://www-csli.stanford.edu/Archive/calendar/current.shtml
and the archives at
http://www-csli.stanford.edu/Archive/calendar/
People on most of the CSLI computers can type 'help csli-calendar' to
see the current issue.
The CSLI Calendar is also posted each week to
news://nntp-csli.stanford.edu/csli.bboard.
and
news://news.stanford.edu/su.events
Information about CSLI's research program is available at
http://www-csli.stanford.edu/
For maps to the Stanford University campus see
http://www.stanford.edu/home/visitors/maps.html
____________