General
Home News About


Contributors
Peter P. Chen Award 2009 Camera-ready submission Call for Papers Call for Workshops Call for PhD Colloquium Call for Posters and Demos Call for Panels Call for Tutorials Manuscript PreparationConference
Conference Program Keynote Speakers Accepted Papers Workshops Posters and Demonstrations Panels Tutorials Conference Committee Program CommitteeParticipants
Registration Venue Travel Information &Invitation Letter Accommodation Attractions Maps Transfers & Optional
Activities Weather

Miscellaneous
Publicity Materials Contact SponsorsKeynote Speakers
Antonio L. Furtado

A Frame Manipulation Algebra for ER Logical Stage Modelling
Antonio L. Furtado, Marco A. Casanova, Karin K. Breitman, Simone D. J. Barbosa
Departamento de Informática. Pontifícia Universidade Católica do Rio de Janeiro
{furtado, casanova, karin, simone}@inf.puc-rio.br



Abstract:
The ER model is arguably today's most widely accepted basis for the conceptual specification of information systems. A further common practice is to use the Relational Model at an intermediate logical stage, in order to adequately prepare for physical implementation. Although the Relational Model still works well in contexts relying on standard databases, it imposes certain restrictions, not inherent in ER specifications, that make it less suitable in Web environments. The paper proposes frames as an alternative to extend the ER approach to logical stage modelling, and treats frames as an abstract data type equipped with a Frame Manipulation Algebra (FMA). It is argued that frames, with a long tradition in AI applications, are able to accommodate the irregularities of semi-structured data, and that frame-sets generalize relational tables, allowing to drop the strict homogeneity requirement. A prototype logic-programming tool has been developed to experiment with FMA. Examples are included to help describe the use of the operators.
Short bio:Antonio L. Furtado is Professor Emeritus at the Departamento de Informática of the Pontifícia Universidade Católica do Rio de Janeiro. He obtained his Ph.D. degree in Computer Science from the University of Toronto in 1974. His main areas of interest are Conceptual Modelling of Information Systems, Logic Programming, and Digital Storytelling. Together with Clesio Saraiva dos Santos (Universidade Federal do Rio Grande do Sul) and Erich J. Neuhold (Darmstadt University of Technology), he participated in the very first Entity-Relationship conference (ER 1979, Los Angeles, California).
Since then, he has been publishing his work in journals and technical events, including eight additional participations in ER conferences. In co-authorship with Erich J. Neuhold and with the collaboration of his colleagues Marco A. Casanova and Paulo A. S. Veloso, he wrote the book Formal Techniques for Data Base Design, Berlin: Springer-Verlag, 1986. He also does research on Medieval Literature, having recently contributed the chapter "The Crusaders' Grail" to the book The Grail, the Quest and the World of Arthur, organized by Norris J. Lacy (Pennsylvania State University), Cambridge: D. S. Brewer, 2008.
With respect to Conceptual Modelling, he is investigating, together with colleagues and students, a multi-disciplinary approach that encompasses static, dynamic and behavioural aspects of Information Systems. This approach borrows plan-generation and plan-recognition methods from Artificial Intelligence, as well as several notions from Literary Theory, Linguistics and Semiotics, such as narrative plots, rhetorical tropes and analogy. Besides its application to business domains, it has been used to characterize literary genres, as part of a Digital Storytelling project.
John Mylopoulos

Conceptual Modeling in the Time of the Revolution: Part II
John Mylopoulos
University of Trento (Italy)


Abstract:
Conceptual Modeling was a marginal research topic at the very fringes of Computer Science in the 60s and 70s, when the discipline was dominated by topics focusing on programs, systems and hardware architectures. Over the years, however, the field has moved to centre stage and has come to claim a central role both in Computer Science research and practice in diverse areas, such as Software Engineering, Databases, Information Systems, the Semantic Web, Business Process Management, Service-Oriented Computing, Multi-Agent Systems, Knowledge Management, and more. The transformation was greatly aided by the adoption of standards in modeling languages (e.g., UML), and model- based methodologies (e.g., Model-Driven Architectures) by the Object Management Group (OMG) and other standards organizations. We briefly review the history of the field over the past 40 years, focusing on the evolution of key ideas. We then note some open challenges and report on-going research, covering topics such as the representation of variability in conceptual models, capturing model intentions, and models of laws.
Notes: A keynote with a similar title was given 12 years ago at CAiSE'97, hence the "part II". The research presented in the talk was conducted jointly with colleagues at the Universities of Toronto (Canada) and Trento (Italy).
Short bio:John Mylopoulos earned a PhD degree from Princeton University in 1970 and has been professor of Computer Science at the University of Toronto since that year. His research interests include conceptual modeling, requirements engineering, data semantics and knowledge management. Mylopoulos is a fellow of the Association for the Advancement of Artificial Intelligence (AAAI, aka American Association for Artificial Intelligence) and the Royal Society of Canada (Academy of Sciences). He has served as programme/general chair of international conferences in Artificial Intelligence, Databases and Software Engineering, including IJCAI (1991), Requirements Engineering (1997), and VLDB (2004). He is currently serving as series co-editor of the Lecture Notes in Business Information Processing (LNBIP) series published by Springer-Verlag.
Since September 2005 Mylopoulos is distinguished professor (chiara fama) of Science at the University of Trento.
Laura Haas

Schema AND Data: A Holistic Approach to Mapping, Resolution and Fusion in Information Integration
Laura Haas - IBM Almaden Research Center, San Jose, CA 95120, USA
Martin Hentschel - Systems Group, ETH Zurich, Switzerland
Donald Kossmann - Systems Group, ETH Zurich, Switzerland
Renée J. Miller - Department of Computer Science, University of Toronto, Canada

Abstract:
To integrate information, data in different formats, from different, potentially overlapping sources, must be related and transformed to meet the users' needs. Ten years ago, Clio introduced nonprocedural schema mappings to describe the relationship between data in heterogeneous schemas. This enabled powerful tools for mapping discovery and integration code generation, greatly simplifying the integration process. However, further progress is needed. We see an opportunity to raise the level of abstraction further, to encompass both data- and schema-centric integration tasks and to isolate applications from the details of how the integration is accomplished. Holistic information integration supports iteration across the various integration tasks, leveraging information about both schema and data to improve the integrated result. Integration independence allows applications to be independent of how, when, and where information integration takes place, making materialization and the timing of transformations an optimization decision that is transparent to applications. In this talk, we define these two important goals, and propose leveraging data mappings to create a framework that supports both data- and schema-level integration tasks.
Short bio:Dr. Laura Haas is an IBM Fellow and director of computer science at Almaden Research Center.
Most recently, she was responsible for Information Integration Solutions (IIS) architecture in IBM's Software Group, after leading the IIS development team through its first two years. Laura Haas joined the development team in 2001 as manager of DB2 UDB Query Compiler development. Previously, Laura was a research staff member and manager at IBM's Almaden Research Center for nearly twenty years. In Research, she worked on and managed a number of exploratory projects in distributed database systems. She is best known for her work on the Starburst query processor (from which DB2 UDB was developed), on Garlic, a system which allowed federation of heterogeneous data sources, and on Clio, the first semi-automatic tool for heterogeneous schema mapping. Garlic technology married with DB2 UDB query processing is the basis for WebSphere Information Integrator's federation capabilities, while Clio capabilities are a core differentiator for the new Rational Data Architect.
Laura is an active member of the database community, serving as vice chair of ACM SIGMOD from 1989-1997, and, currently, as vice president of the VLDB Board of Trustees, as well as on many program committees for technical conferences. She has received several IBM awards for Outstanding Technical Achievement, and an IBM Corporate Award for her work on federated database technology. She is a member of the IBM Academy of Technology.
Divesh Srivastava

Data Auditor: Analyzing Data Quality using Pattern Tableaux (Industrial Keynote)
Divesh Srivastava
AT&T Labs-Research

Abstract:
Monitoring databases maintain configuration and measurement tables about computer systems, such as networks and computing clusters, and serve important business functions, such as troubleshooting customer problems, analyzing equipment failures, planning system upgrades, etc. These databases are prone to many data quality issues: configuration tables may be incorrect due to data entry errors, while measurement tables may be affected by incorrect, missing, duplicate and delayed polls.
We describe Data Auditor, a tool for analyzing data quality and exploring data semantics of monitoring databases. Given a user-supplied constraint, such as a boolean predicate expected to be satisfied by every tuple, a functional dependency, or an inclusion dependency, Data Auditor computes "pattern tableaux", which are concise summaries of subsets of the data that satisfy or fail the constraint. We discuss the architecture of Data Auditor, including the supported types of constraints and the tableau generation mechanism. We also show the utility of our approach on an operational network monitoring database.
This is joint work with Lukasz Golab, Howard Karloff and Flip Korn.
Short bio:Divesh Srivastava is the head of Database Research at AT&T Labs Research. He received his Ph.D. from the University of Wisconsin, Madison, and his Bachelor of Technology from the Indian Institute of Technology, Bombay, India. His current research interests include data quality, data streams and data privacy.
Paul Nielsen

A personal view of the joys and responsibilities of a career in science (Distinguished Speaker)
Director of the Software Engineering Institute (SEI) of Carnegie-Mellon University
Research is often a tough business and world economic conditions have made it tougher. In the midst of a sometimes discouraging environment, we should remember how fortunate we are to work as scientists and engineers. We explore new realms, work with an international community of talented men and women, and work at jobs that we love. With these benefits come responsibilities. As individuals and as a group, we need to mentor the next generation, increase the role and impact of scientists on important global issues, and do more to enhance the understanding and excitement about science and technology.
Short bio:Dr. Paul D. Nielsen is Director and Chief Executive Officer of the Software Engineering Institute (SEI), a Federally Funded Research and Development Center operated by Carnegie Mellon University. The SEI advances software engineering principles and practices through focused research and development, which is transitioned to the broad software engineering community.
Prior to his arrival as SEI Director, Nielsen served in the U.S. Air Force, retiring as a Major General after 32 years of distinguished service. As commander of the Air Force Research Laboratory at Wright-Patterson Air Force Base in Ohio for more than four years, he managed the Air Force’s science and technology budget of more than $3 billion annually. He also served as the Air Force’s technology executive officer, determining the investment strategy for the full spectrum of Air Force science and technology activities.
In 2004, Nielsen became a Fellow of the American Institute of Aeronautics and Astronautics (AIAA). He served as the AIAA President from 2007-2008 and is a member of the AIAA Board of Directors. In 2006, he was elected as a Fellow of the Institute of Electrical and Electronic Engineers (IEEE). Nielsen serves on the Air Force Scientific Advisory Board and is a member of the Board of Directors for the Hertz Foundation, a non-profit that awards graduate school fellowships in the applied sciences.