CURRENT PROJECTS

Project Name A Patient Wellness Tracking System: Dealing with Database Usability Issues in Electronic Medical Record Systems
Description We are developing, designing, and evaluating a patient wellness tracking system for the 11th Street Family Health Services Center of Drexel University, an ambulatory setting located in the center of  Philadelphia. We first evaluate the effectiveness of the  goal-oriented analysis for designing patient-centered Health IT systems for chronic diseases management. We then investigate a number of issues related to database design and usability. The primary objective is to develop a flexible system that allows healthcare professionals to rapidly gather and analyze healthcare data using structured databases.
Keywords System Design, Health IT, Goal Modeling, Requirements Engineering, Data Integration, Database Design, Database Usability
Sponsors The 11th Street Family Health Services Center of Drexel University, iSchool at Drexel.
Members Yuan An, Ritu Khare, Patricia Gerrity, Michelle Rogers, Prudence Dalrymple
Publications
  1. Yuan An, Prudence Dalrymple, Michelle Rogers, Patricia Gerrity, Jennifer Horkoff, Eric Yu. Collaborative Social Modeling for Designing a Patient Wellness Tracking System in a Nurse-Managed Healthcare Center. In the 4th International Conference on Design Science Research in Information Systems and Technology (DESRIST'09). Philadelphia, PA, USA. 2009.
  2. (pdf)
Project Website  


Project Name KROSS: Knowledge Repository of Schemas and Semantics
Description
Despite the ubiquitous use of the Internet for sharing and locating information, it is difficult on the current Web to find schemas by given semantics and requirements. However, with the rapidly increasing demand on information integration across distributed sources, it is desirable and very important to (1) search and reuse (parts of) schemas that describe data in data sources, (2) design schemas with good properties to facilitate data integration in unforeseen situations, and (3) maintain and share semantic mappings between heterogeneous schemas. The purpose of the KROSS project is to investigate novel corpus-based approaches and develop effective and efficient tools for tracing and sharing schemas and semantics to address the above challenges. A schema refers to a data representation which describes elements and relationship in a particular domain, e.g., relational schema. Semantics of a schema amounts to the correspondence between the schema and the subject matter it describes. A great deal of effort has been put into the problem of discovering semantic mappings between schemas. It is surprisingly rare, however, to consider the upfront and intuitively more effective effort on sharing and reusing schemas as well as their semantics in the process of schema design and mapping creation.


The KROSS (Knowledge Repository Of Schemas and Semantics) repository contains classified and indexed schemas and mappings. We develop a set of effective and efficient tools for utilizing the repository for schema management and integration. Specifically, we employ techniques in database (schema integration and mapping) and artificial intelligence (machine learning and ontology) to attack the following central challenges: (1) Dynamically creating and maintaining the KROSS schema and mapping repository. (2) Searching schemas with respect to given semantics using keywords as well as structured queries. (3) Developing a design-by-example schema design approach which generates a new schema by searching and combining existing schemas. (4) Discovering schema semantics using corpora of schemas.
 

The KROSS project brings many transformational innovations to the study of schema design, schema mapping, and data integration. It draws upon the strengths of existing technologies and adds many novel approaches and significant contributions. The KROSS repository contains a number of classified and archived representations of each concept in a specific domain. It supports the development of integration-aware and semantics-enriched schemas through a transformational design-by-example schema design approach. Moreover, aggregated symbolic and probabilistic evidence provides many opportunities to increase the automation of schema mapping.

Keywords Web Search, Meta Search, Web Information Systems, Schema Mapping, Conceptual Modeling
Sponsors iSchool at Drexel
Members Yuan An, Il-Yeol Song
Publications
  1. Y. An and I. Song. Discovering Semantically Similar Associations (SeSA) for Complex Mapping between Conceptual Models. In the Proceedings of the 27th International Conference on Conceptual Modeling (ER'08). 20-23 October 2008, Barcelona, Spain.


  2. Y. An, X. (Tony) Hu, and I. Song. Round-Trip Engineering for Maintaining Conceptual-Relational Mappings. In the Proceedings of  20th International Conference on Advanced Information Systems Engineering (CAiSE'08). 16-20 June 2008, Montpellier, France.
Project Website http://cluster.ischool.drexel.edu:8080/kross


Project Name iBioSearch (Integrated Biological Database Search)
Description The integrated biological databases search (iBioSearch) system is aimed at providing a unifying interface and advanced capability for searching various online biological databases. The iBioSearch system is an online search system which dynamically builds a biological database search ontology as the user interface. The ontology acting as a global schema connects to more than 1000 online biological databases. User's queries against the ontology are translated into queries against underlying online query interfaces. Query results returned from the underlying databases are consolidated, reconciled, and ranked by data cleansing and relevance computing algorithms. Salient features of iBioSearch system includes dynamically updating biological databases repository and maintaining semantic mappings when base databases evolve.
Keywords Data Integration, Bioinformatics, Web Search, Meta Search, Web Information Systems, Schema Mapping, Ontology-Based Information Integration, Reverse Engineering, Conceptual Modeling
Sponsors iSchool at Drexel
Members Yuan An, Ritu Khare
Publications
  1. Ritu Khare and Yuan An. An Empirical Study on Using Hidden Markov Model for Search Interface Segmentation. To appear in the Proceedings of 18th ACM Conference on Information and Knowledge Management (CIKM'09). HongKong, China. 2009.

  2. Xin Chen, Caimei Lu, and Yuan An. Probabilistic Models for Topic Learning from Images and Captions in Online Biomedical Literatures. To appear in the Proceedings of 18th ACM Conference on Information and Knowledge Management (CIKM'09). HongKong, China. 2009.

Project Website  http://ibiosearch.sourceforge.net


PAST PROJECTS


Project Name MAPONTO
Description MAPONTO is a semantic mapping discovery tool. It was implemented as a third-party plugin of the Protege knowledge management platform. MAPONTO is available for downloading and testing.
Keywords Ontology, Semantic Mapping, Database Design, Data Semantics
Sponsors  KM Lab, Department of Computer Science, University of Toronto
Members  Yuan An, Alex Borgida, John Mylopoulos
Project Website  http://www.cs.toronto.edu/semanticweb/maponto