Big Data Management and Mining Laboratory

Publications

Sort By:  


  • 2023 - Large scale proteomic studies create novel privacy considerations

      Authors:Andrew C. Hill, Claire Guo, Elizabeth M. Litkowski, Ani W Manichaikul, Bing Yu, Iain R. Konigsberg, Betty A. Gorbet, Leslie A. Lange, Katherine A. Pratte, Katerina J. Kechris, Matthew DeCamp, Marilyn Coors, Victor E. Ortega, Stephen S. Rich, Jerome I. Rotter, Robert E. Gerzsten, Clary B. Clish, Jeffrey L. Curtis, Xiaowei Hu, Ma-en Obeidat, Melody Morris, Joseph Loureiro, Debby Ngo, Wanda K. O'Neal, Deborah A. Meyers, Eugene R. Bleecker, Brian D. Hobbs, Michael H. Cho, Farnoush Banaei-Kashani, Russell P. Bowler

      Publication:Nature

      Category: Journal

      Link: TBA

  • 2023 - Evaluation of Quantitative Decision-Making for Rhythm Management of Atrial Fibrillation Using Tabular Q-Learning

      Authors:Christopher D. Barrett, Yuto Suzuki, Sundos Hussein, Lohit Garg, Alexis Tumolo, Amneet Sandhu, John J. West, Matthew Zipse, Ryan Aleong, Paul Varosy, Wendy S. Tzou, Farnoush Banaei-Kashani and Michael A. Rosenberg

      Publication:JAHA

      Category: Journal

      Link: JAHA

  • 2023 - Deep Learning on Graphs for Multi-Omics Classification of COPD

      Authors:Yonghua Zhuang, Fuyong Xing, Debashis Ghosh, Brian D Hobbs, Craig P Hersh, Farnoush Banaei-Kashani, Russell P. Bowler, Katerina Kechris

      Publication:PLOS ONE

      Category: Journal

      Link: TBA

  • 2023 - Unsupervised Detection of Anomalous Bridge Performance

      Authors: Ryan Cheng, Yail J. Kim, Farnoush Banaei-Kashani

      Publication:Transportation Research Board Annual Meeting

      Category: Conference

      Link: TBA

  • 2023 - Physics-Guided Neural Networks for Bridge Deterioration Forecasting

      Authors: Tobby Lie, Kevin Rens, Farnoush Banaei-Kashani

      Publication:Transportation Research Board Annual Meeting

      Category: Conference

      Link: TBA

  • 2022 - ST-DeepGait: A Spatiotemporal Deep Learning Model for Human Gait Recognition

      Authors: Latisha Konz, Andrew Hill, Farnoush Banaei-Kashani

      Publication:MDPI

      Category: Journal

      Link: Open Access Article

  • 2022 - Effective Subject Representation based on Multi-omics Disease Networks using Graph Embedding

      Authors: Sundous Hussein, Thao Vu, Katerina Kechis, Russel Bowler, Leslie Lange, Farnoush Banaei-Kashani

      Publication:IEEE BIBM

      Category: Conference

      Link: TBA

  • 2022 - Semi-supervised Embedding for Scalable and Accurate Time Series Clustering

      Authors: Andrew Hill, Russell Bowler, Katerina Kechris, Farnoush Banaei-Kashani

      Publication:IEEE Big Data

      Category: Conference

      Link: TBA

  • 2022 - Learn2Pool: Efficient and Effective Ride Assignment in Ride Sharing Systems

      Authors: Ryan Cheng, Selvakumar Jayaraman, Robert Fitzgerald, Farnoush Banaei-Kashani

      Publication: IWCTS

      Category: Conference

      Link: TBA

  • 2022 - Significant Subgraph Detection in Multi-Omics Networks for Disease Pathway Identification

      Authors: Mohamed Abdel-Hafiz, Mesbah Najafi, Shahab Helmi, Katherine A. Pratte, Yonghua Zhuang, Weixuan Liu, Katerina Kechris, Russell Bowler, Leslie Lange, Farnoush Banaei-Kashani

      Publication:Frontiers in Big Data

      Category: Journal

      Link: METHODS article

  • 2022 - iLFQA: A Platform for Efficient and Accurate Long-Form Question Answering

      Authors: Rhys Butler, Vishnu Dutt Duggirala, Farnoush Banaei-Kashani

      Publication:WSDM

      Category: Journal

      Link: TBA

  • 2021 - City-Scale System-Optimal Route Planning with Route Replanning (short paper)

      Authors: Robert Fitzgerald, Farnoush Banaei-Kashani

      Publication:IEEE Big Data

      Category: Conference

      Link: TBA

  • 2021 - Online Route Replanning for Scalable System-Optimal Route Planning (short paper)

      Authors:Robert Fitzgerald, Farnoush Banaei-Kashani

      Publication:SIGSPATIAL

      Category: Conference

      Link: TBA

  • 2021 - Multi-omics subtyping pipeline for chronic obstructive pulmonary disease

      Authors: Lucas A. Gillenwater, Shahab Helmi, Evan Stene, Katherine A. Pratte, Yonghua Zhuang, Ronald P. Schuyler, Leslie Lange, Peter J. Castaldi, Craig P. Hersh, Farnoush Banaei-Kashani, Russell P. Bowler, Katerina J. Kechris

      Publication:PloS one

      Category: Journal

      Link: Plos One

    Abstract: Chronic Obstructive Pulmonary Disease (COPD) is the third leading cause of mortality in the United States; however, COPD has heterogeneous clinical phenotypes. This is the first large scale attempt which uses transcriptomics, proteomics, and metabolomics (multi-omics) to determine whether there are molecularly defined clusters with distinct clinical phenotypes that may underlie the clinical heterogeneity. Subjects included 3,278 subjects from the COPDGene cohort with at least one of the following profiles: whole blood transcriptomes (2,650 subjects); plasma proteomes (1,013 subjects); and plasma metabolomes (1,136 subjects). 489 subjects had all three contemporaneous -omics profiles. Autoencoder embeddings were performed individually for each -omics dataset. Embeddings underwent subspace clustering using MineClus, either individually by -omics or combined, followed by recursive feature selection based on Support Vector Machines. Clusters were tested for associations with clinical variables. Optimal single -omics clustering typically resulted in two clusters. Although there was overlap for individual -omics cluster membership, each -omics cluster tended to be defined by unique molecular pathways. For example, prominent molecular features of the metabolome-based clustering included sphingomyelin, while key molecular features of the transcriptome-based clusters were related to immune and bacterial responses. We also found that when we integrated the -omics data at a later stage, we identified subtypes that varied based on age, severity of disease, in addition to diffusing capacity of the lungs for carbon monoxide, and precent on atrial fibrillation. In contrast, when we integrated the -omics data at an earlier stage by treating all data sets equally, there were no clinical differences between subtypes. Similar to clinical clustering, which has revealed multiple heterogenous clinical phenotypes, we show that transcriptomics, proteomics, and metabolomics tend to define clusters of COPD patients with different clinical characteristics. Thus, integrating these different -omics data sets affords additional insight into the molecular nature of COPD and its heterogeneity.

  • 2021 - Machine Learning Characterization of COPD Subtypes: Insights From the COPDGene Study

      Authors: Peter J. Castaldi, Adel Boueiz, Jeong Yun, Raul San Jose Estepar, James C. Ross, George Washko, Michael H. Cho, Craig P. Hersh, Gregory L. Kinney, Kendra A. Young, Elizabeth A. Regan, David A. Lynch, Gerald J. Criner, Jennifer G. Dy, Stephen I. Rennard, Richard Casaburi, Barry J. Make, James Crapo, Edwin K. Silverman, John E. Hokanson, James D. Crapo, Edwin K. Silverman, Barry J. Make, Elizabeth A. Regan, Terri Beaty, Ferdouse Begum, Peter J. Castaldi, Michael Cho, Dawn L. DeMeo, Adel R. Boueiz, Marilyn G. Foreman, Eitan Halper-Stromberg, Lystra P. Hayden, Craig P. Hersh, Jacqueline Hetmanski, Brian D. Hobbs, John E. Hokanson, Nan Laird, Christoph Lange, Sharon M. Lutz, Merry-Lynn McDonald, Margaret M. Parker, Dmitry Prokopenko, Dandi Qiao, Elizabeth A. Regan, Phuwanat Sakornsakolpat, Edwin K. Silverman, Emily S. Wan, Sungho Won, Juan Pablo Centeno, Jean-Paul Charbonnier, Harvey O. Coxson, Craig J. Galban, MeiLan K. Han, Eric A. Hoffman, Stephen Humphries, Francine L. Jacobson, Philip F. Judy, Ella A. Kazerooni, Alex Kluiber, David A. Lynch, Pietro Nardelli, John D. Newell, Aleena Notary, Andrea Oh, Elizabeth A. Regan, James C. Ross, Raul San Jose Estepar, Joyce Schroeder, Jered Sieren, Berend C. Stoel, Juerg Tschirren, Edwin {Van Beek}, Bram {van Ginneken}, Eva {van Rikxoort}, Gonzalo Vegas Sanchez-Ferrero, Lucas Veitel, George R. Washko, Carla G. Wilson, Robert Jensen, Douglas Everett, Jim Crooks, Katherine Pratte, Matt Strand, Carla G. Wilson, John E. Hokanson, Gregory Kinney, Sharon M. Lutz, Kendra A. Young, Surya P. Bhatt, Jessica Bon, Alejandro A. Diaz, MeiLan K. Han, Barry Make, Susan Murray, Elizabeth Regan, Xavier Soler, Carla G. Wilson, Russell P. Bowler, Katerina Kechris,, Farnoush Banaei-Kashani

      Publication:Chest

      Category: Journal

      Link: ScienceDirect

    Abstract: COPD is a heterogeneous syndrome. Many COPD subtypes have been proposed, but there is not yet consensus on how many COPD subtypes there are and how they should be defined. The COPD Genetic Epidemiology Study (COPDGene), which has generated 10-year longitudinal chest imaging, spirometry, and molecular data, is a rich resource for relating COPD phenotypes to underlying genetic and molecular mechanisms. In this article, we place COPDGene clustering studies in context with other highly cited COPD clustering studies, and summarize the main COPD subtype findings from COPDGene. First, most manifestations of COPD occur along a continuum, which explains why continuous aspects of COPD or disease axes may be more accurate and reproducible than subtypes identified through clustering methods. Second, continuous COPD-related measures can be used to create subgroups through the use of predictive models to define cut-points, and we review COPDGene research on blood eosinophil count thresholds as a specific example. Third, COPD phenotypes identified or prioritized through machine learning methods have led to novel biological discoveries, including novel emphysema genetic risk variants and systemic inflammatory subtypes of COPD. Fourth, trajectory-based COPD subtyping captures differences in the longitudinal evolution of COPD, addressing a major limitation of clustering analyses that are confounded by disease severity. Ongoing longitudinal characterization of subjects in COPDGene will provide useful insights about the relationship between lung imaging parameters, molecular markers, and COPD progression that will enable the identification of subtypes based on underlying disease processes and distinct patterns of disease progression, with the potential to improve the clinical relevance and reproducibility of COPD subtypes.

  • 2021 - Detection of Microsleep Events with a Behind-the-ear Wearable System

      Authors: Biu et al.

      Publication: IEEE Transaction of Mobile Computing

      Category: Journal

      Link: TBA

    Abstract: TBA

  • 2021 - Aptardi predicts polyadenylation sites in sample-specific transcriptomes using high throughput RNA sequencing and DNA sequence

      Authors: Ryan Lusk, Evan Stene, Farnoush Banaei-Kashani, Boris Tabakoff, Katerina Kechris, Laura Saba

      Publication: Nature Communications

      Category: Journal

      Link: TBA

    Abstract: TBA

  • 2021 - iTA: A Digital Teaching Assistant

      Authors: Vishnu Dutt Duggirala, Rhys Sean Butler, Farnoush Banaei-Kashani

      Publication: CSEDU

      Category: Conference

      Link: TBA

    Abstract: Designed and implemented a question-answering chatbot, dubbed iTA (intelligent Teaching Assistant), which can provide detailed answers to questions by effectively identifying the most relevant answers in “long” text sources (documents or textbooks). iTA answers questions by implementing a two-stage procedure. First, the topmost relevant paragraphs are identified in the selected text source using a retrieval-based approach and scores for the retrieved paragraphs are computed. Second, using a generative model, extracted the relevant content from the top-ranked paragraph to generate the answer. Our results show that iTA is well suited to generate meaningful answers for questions posed by students.

  • 2021 - A Scalable Role Mining Approach for Large Organizations

      Authors: Masoumeh Abolfathi, Zohreh Raghebi, Jafar Haadi Jafarian, Farnoush Banaei-Kashani

      Publication: IWSPA

      Category: Workshop

      Link: TBA

    Abstract: TBA

  • 2020 - Multi-Modal Classification for Polarization Intent Detection in Social Media

      Authors: Tobby Lie, Haadi Jafarian, Stephen Hartnett, Hamilton Bean and Farnoush Banaei-Kashani

      Publication: KDD TrueFact 2020

      Category: Workshop

      Link: TBA

    Abstract: The ease of access to social media has made it extremely easy for malicious agents to disseminate divisive information on a mass scale. With the widespread reach of messages with various intents it becomes more difficult for experts to analyze such information in a timely and accurate manner to detect these intents. Within the research community, capturing online threats of this nature has been a growing area of interest. In this paper, we develop a methodology to detect intention of social polarization information shared by a single agent in social media. Toward this end, we use and study the Internet Agency Facebook ads dataset released by U.S. House Intelligence Committee. We train and evaluate a series of models for text- and image-based intention detection and show that a multi-modal model that uses both data modalities outperforms other unimodal models.

  • 2020 - Painometry: Wearable and Objective Quantification System for Acute Postoperative Pain

      Authors: H. Truong, N. Bui, Z. Raghebi, M. Ceko, N. Pham, P. Nguyen, A. Nguyen, T. Kim, K. Siegfried, E. Stene, T. Tvrdy, L. Weinman, T. Payne, D. Burke, T. Dinh, S. D'Mello, F. Banaei-Kashani, T. Wager, P. Goldstein, T. Vu

      Publication: MobiSys 2020

      Category: Conference

      Link: TBA

    Abstract: TBA

  • 2020 - WAKE: A Behind-the-ear Wearable System for Microsleep Detection

      Authors: . Truong, T. Dinh, Z. Raghebi, T. Kim, N. Bui, P. Nguyen, H. Truong, F. Banaei-Kashani, A. Halbower, T. N. Dinh, T. Vu

      Publication: MobiSys 2020

      Category: Conference

      Link: TBA

    Abstract: TBA

  • 2020 - Multiscale Frequent Co-movement Pattern Mining

      Authors: Shahab Helmi, Farnoush Banaei-Kashani

      Publication: ICDE 2020

      Category: Conference

      Link: TBA

    Abstract: Thanks to recent prevalence of location tracking technologies, collecting massive spatiotemporal datasets containing moving object trajectories has become possible, providing an exceptional opportunity to derive interesting insights about the behavior of moving objects such as people, animals, and vehicles. In particular, mining patterns from "co-movements" of objects (such as movements by players of a sports team, joints of the human body while walking, and vehicles in a transportation network) can lead to the discovery of interesting patterns (e.g., offense tactics of a sports team, gait signature of a person, and driving behaviors causing heavy traffic). Various trajectory mining and frequent pattern mining techniques have been proposed to discover patterns in trajectory datasets and more generally, event sequences. However, existing approaches are inapplicable for co-movement pattern mining from multi-trajectory datasets. In this paper, we propose a novel and efficient framework for co-movement pattern mining. We also extend this framework for efficient mining of such patterns at multiple spatial scales. The performance of the proposed solutions is evaluated by conducting extensive experiments using two real datasets, a soccer game dataset and a human gait dataset. Our experimental results show that our proposed algorithms are promising.

  • 2019 - SeAlM: A Query Cache Optimization Technique for Next Generation Sequence Alignment

      Authors: Evan Stene, Farnoush Banaei-Kashani

      Publication:DMBIH 2019

      Category: Workshop

      Link: TBA

    Abstract: Genetic data from next-generation sequencing (NGS) technology is being produced at an ever increasing rate - already outpacing the well known Moore’s Law. Due to this pace of NGS data generation, new methods are necessary in order to facilitate rapid sequence analysis at the enormous scale required. The need for such methods is further compounded by the dropping financial cost of sequencing, leading to the normalization of large-scale genome studies spanning entire populations. A key process in the genomic data analysis pipeline, and one that is often most time consuming, is read mapping or so-called alignment. This paper introduces Sequence Alignment Memorizer (SeAlM), a technique that reduces the number of redundant alignments to enable population-scale workloads. SeAlM uses a novel method for reordering alignment queries from multiple sources to create batches with increased likelihood of containing redundant queries that can be de- duplicated before alignment, while also ordering those batches to improve the ability to cache queries effectively. We show that our technique can improve the average throughput of alignment for a single human sample by 6.5% and a population of 10 human subjects by 13.6% -18.8% depending on the type of genetic data used.

  • 2019 - Using similarity measures for medical event sequences to predict mortality in trauma patients

      Authors: Joel Fredrickson, Michael Mannino, Omar Alqahtani, Farnoush Banaei-Kashani

      Publication: Decision Support Systems

      Category: Journal

      Link: ScienceDirect

    Abstract: We extend a similarity measure for medical event sequences (MESs) and evaluate its classification performance for retrospective mortality prediction of trauma patient outcomes. Retrospective mortality prediction is a benchmarking task used by trauma care governance bodies to assist with policy decisions. We extend the similarity measure, the Optimal Temporal Common Subsequence for MESs (OTCS-MES), by generalizing the event-matching component with a plug-in weighting element. The extended OTCS-MES uses an event prevalence weight developed in our previous study and an event severity weight developed for this study. In the empirical evaluation of classification performance, we provide a more complete evaluation than previous studies. We compare the predictive performance of the Trauma Mortality Prediction Model (TMPM), an accepted regression approach for mortality prediction in trauma data, to nearest neighbor algorithms using similarity measures for MESs. Using a data set from the National Trauma Data Bank, our results indicate improved predictive performance for an ensemble of nearest neighbor classifiers over TMPM. Our analysis reveals a superior Receiver Operating Characteristics (ROC) curve, larger AUC, and improved operating points on a ROC curve. We also study methods to adjust for uncommon class prediction: weighted voting, neighborhood size, and case base size. Results provide strong evidence that similarity measures for medical event sequences are a powerful and easily adapted method assisting with health care policy advances.

  • 2019 - Toward System-Optimal Route Guidance

      Authors: Robert J. Fitzgerald, Farnoush Banaei-Kashani

      Publication: IEEE MDM 2019

      Category: Conference

      Link: TBA

    Abstract: The existing online mapping systems process many user route queries simultaneously, yet solve each independently, using typical route guidance solutions. These route recommendations are presented as optimal, but often this is not truly the case, due to the effects of competition users experience over the resulting experienced routes, a phenomenon referred to in Game Theory as a Nash Equilibrium. Additionally, route plans of this nature can result in poor utilization of the road network from a system-optimizing perspective as well. In this paper, we introduce an enhanced approach for route guidance, motivated by the relevance of a system optimal equilibrium strategy, while also maintaining some fairness to the individual. With this approach the objective is to optimize the global road network utilization (as measured by, e.g., mobility, or global emissions) by selecting from a set of generally fair user route alternatives in a batch setting.
    For the first time, we present an approximate, anytime algorithm based on Monte Carlo Tree Search and Eppstein’s Top-K Shortest Paths algorithm to solve this complex dual optimization problem in real-time. This approach attempts to identify and avoid the potentially harmful network effects of sub-optimal route combinations. Experiments show that mobility optimization over real road networks of Rye and Golden, Colorado in a microscopic traffic simulation with a network congestion-minimizing objective can achieve considerable mobility improvement for users, as observed by their effective travel time improvement up to 12% with some consideration of route fairness.

  • 2018 - Device-measured Physical Activity Data for Classification of Patients with Ventricular Arrhythmia Events: A Pilot Investigation

      Authors: Lucas Marzec, Sridharan Raghavan, Farnoush Banaei-Kashani, Seth Creasy, Edward L. Melanson, Leslie Lange, Debashis Ghosh, Michael A. Rosenberg

      Publication: PLoS ONE

      Category: Journal

      Link: PloS ONE

    Abstract: Low levels of physical activity are associated with increased mortality risk, especially in cardiac patients, but most studies are based on self-report. Cardiac implantable electronic devices (CIEDs) offer an opportunity to collect data for longer periods of time. However, there is limited agreement on the best approaches for quantification of activity measures due to the time series nature of the data. We examined physical activity time series data from 235 subjects with CIEDs and at least 365 days of uninterrupted measures. Summary statistics for raw daily physical activity (minutes/day), including statistical moments (e.g., mean, standard deviation, skewness, kurtosis), time series regression coefficients, frequency domain components, and forecasted predicted values, were calculated for each individual, and used to predict occurrence of ventricular tachycardia (VT) events as recorded by the device. In unsupervised analyses using principal component analysis, we found that while certain features tended to cluster near each other, most provided a reasonable spread across activity space without a large degree of redundancy. In supervised analyses, we found several features that were associated with the outcome (P < 0.05) in univariable and multivariable approaches, but few were consistent across models. Using a machine-learning approach in which the data was split into training and testing sets, and models ranging in complexity from simple univariable logistic regression to ensemble decision trees were fit, there was no improvement in classification of risk over naïve methods for any approach. Although standard approaches identified summary features of physical activity data that were correlated with risk of VT, machine-learning approaches found that none of these features provided an improvement in classification. Future studies are needed to explore and validate methods for feature extraction and machine learning in classification of VT risk based on device-measured activity.

  • 2018 - LIBS: A Bioelectrical Sensing System from Human Ears for Staging Whole-Night Sleep Study

      Authors: Anh Nguyen, Raghda Alqurashi, Zohreh Raghebi, Farnoush Banaei-Kashani, Ann C. Halbower, Tam Vu

      Publication: CACM

      Category: Journal

      Link: ACM Digital Library

    Abstract: Sensing physiological signals from the human head has long been used for medical diagnosis, human-computer interaction, meditation quality monitoring, among others. However, existing sensing techniques are cumbersome and not desirable for long-term studies and impractical for daily use. Due to these limitations, we explore a new form of wearable systems, called LIBS, that can continuously record biosignals such as brain wave, eye movements, and facial muscle contractions, with high sensitivity and reliability. Specifically, instead of placing numerous electrodes around the head, LIBS uses a minimal number of custom-built electrodes to record the biosignals from human ear canals. This recording is a combination of three signals of interest and unwanted noise. Therefore, we design an algorithm using a supervised Nonnegative Matrix Factorization (NMF) model to split the single-channel mixed signal into three individual signals representing electrical brain activities (EEG), eye movements (EOG), and muscle contractions (EMG). Through prototyping and implementation over a 30 day sleep experiment conducted on eight participants, our results prove the feasibility of concurrently extracting separated brain, eye, and muscle signals for fine-grained sleep staging with more than 95% accuracy. With this ability to separate the three biosignals without loss of their physiological information, LIBS has a potential to become a fundamental in-ear biosensing technology solving problems ranging from self-caring health to non-health and enabling a new form of human communication interfaces.

  • 2018 - Mortality Prediction Performance using Similarity Measures for Medical Event Sequences

      Authors: Joel Fredrickson, Michael Mannino, Omar Alqahtani, Farnoush Banaei-Kashani

      Publication: Decision Support Systems

      Category: Conference

      Link: AIS eLibrary

    Abstract: We extend a similarity measure for medical event sequences (MESs) and evaluate its performance on mortality prediction using a substantial trauma data set. We extend the Optimal Temporal Common Subsequence for MESs (OTCS-MES) measure by generalizing the event-matching component with user-defined weights. In the empirical evaluation of classification performance, we provide a more complete evaluation than previous studies. We compare the predictive performance of the Trauma Mortality Prediction Model (TMPM), an accepted regression approach for mortality prediction in trauma data, to nearest neighbor algorithms using similarity measures for MESs. Using a data set from the National Trauma Data Bank, our results indicate improved predictive performance for an ensemble of nearest neighbor classifiers over TMPM. Our analysis demonstrates a superior Receiver Operating Characteristics (ROC) curve, larger AUC, and improved operating points on a ROC curve. Predictive performance improves for the ensemble for a variety of sensitivity weights and false positive constraints.

  • 2018 - Reach Me If You Can! Reachability Query in Uncertain Contact Networks

      Authors: Z. Raghebi, F. Banaei-Kashani

      Publication: GeoRich '18, SIGMOD

      Category: Workshop

      Link: TBA

    Abstract: With the advent of reliable positioning technologies and prevalence of location-based services, it is now feasible to accurately study the propagation of items such as infectious viruses, sensitive information pieces, and malwares through a population of moving objects, e.g., individuals, vehicles, and mobile devices. In such application scenarios, an item passes between two objects when the objects are sufficiently close (i.e., when they are, so-called, in contact), and hence once an item is initiated, it can propagate in the object population through the evolving network of contacts among objects, termed contact network. In this paper, for the first time we define and study probabilistic reachability queries in large uncertain contact networks, where propagation of items through contacts are uncertain. A probabilistic reachability query verifies whether two objects are “reachable” through the evolving uncertain contact network with a probability greater than a threshold η. For efficient processing of probabilistic queries, we propose a novel index structure, termed spatiotemporal tree cover (STC), which leverages the spatiotemporal properties of the contact network for efficient processing of the queries. Our experiments with real data demonstrate superiority of our proposed solution versus the only other existing solution (based on Monte Carlo sampling) for processing probabilistic reachability queries in generic uncertain graphs, with 300% improvement in query processing time on average.

  • 2018 - Enabling Prediction for Optimal Fuel Economy Vehicle Control

      Authors: Zachary D. Asher, Jordan A. Tunnell, David A. Baker, Robert J. Fitzgerald, Farnoush Banaei-Kashani, Sudeep Pasricha, Thomas H. Bradley

      Publication: SAE International

      Category: Technical Paper

      Link: SAE International

    Abstract: Vehicle control using prediction based optimal energy management has been demonstrated to achieve better fuel economy resulting in economic, environmental, and societal benefits. However, research focusing on prediction derivation for use in optimal energy management is limited despite the existence of hundreds of optimal energy management research papers published in the last decade. In this work, multiple data sources are used as inputs to derive a prediction for use in optimal energy management. Data sources include previous drive cycle information, current vehicle state, the global positioning system, travel time data, and an advanced driver assistance system (ADAS) that can identify vehicles, signs, and traffic lights. To derive the prediction, the data inputs are used in a nonlinear autoregressive artificial neural network with external inputs (NARX). Two real world drive cycles were developed for analysis in the Denver, Colorado region: a city-focused drive cycle that passes through downtown as well as a highway-focused drive cycle that transitions across multiple interstates. A validated model of a 2010 Toyota Prius in Autonomie is used to determine the vehicle control fuel economy improvements that are possible from the NARX prediction. The optimal energy management control strategy is determined using dynamic programming due to its ease of use and that the solution produced is the globally optimal solution. The control strategies compared include the existing 2010 Toyota Prius control strategy as a baseline, the neural network prediction optimal energy management control strategy, and a 100% accurate prediction optimal energy management control strategy. Results show that inclusion of various sensors and signals enables a significant amount of the fuel economy improvement with respect to 100% accurate prediction. The conclusion is that prediction based optimal energy management enabled fuel economy improvements can be realized with currently available sensors and signals.

  • 2018 - Probabilistic reachability query in evolving spatiotemporal contact networks of moving objects

      Authors: Z. Raghebi, F. Banaei-Kashani

      Publication: ACM SIGSPATIAL

      Category: Short Paper

      Link: ACM Digital Library

    Abstract: With the rapid development of location sensors, it is now possible to study how various items (such as viruses and messages) spread across populations of moving objects at scale. In such applications, two objects are considered in-contact while they are sufficiently close to each other. Such a dynamic network of objects, so-called a "contact network". In this paper, we define and study probabilistic reachability queries in uncertain contact networks, where contacts between objects are probabilistic. A probabilistic reachability query verifies whether two objects are "reachable" with a probability no less than a threshold η. We introduce Optimized Spatiotemporal Tree Cover, an index structure that leverages the spatiotemporal properties of the contact network to enable efficient processing of the reachability queries on large uncertain contact networks. With an extensive study using both real and synthetic datasets, we demonstrate superiority of our proposed solution versus a baseline solution (i.e., Monte Carlo Sampling) and the only other existing solution for reachability queries on uncertain contact networks, with 350% and 150% improvement in query processing time on average, respectively.

  • 2018 - Efficient Processing of Probabilistic Single and Batch Reachability Queries in Large and Evolving Spatiotemporal Contact Networks

      Authors: Z. Raghebi, F. Banaei-Kashani

      Publication: IEEE BigData

      Category: Short Paper

      Link: TBA

    Abstract: With the rapid development of location sensors, it is now possible to accurately study how various items (such as viruses or messages) spread across populations of moving objects. In such applications, an item can propagate through the object population where two objects are close. Such a dynamic network of objects called a “contact network”. In this paper, we define and study a family of probabilistic reachability queries in uncertain contact networks, where contacts between objects are probabilistic. A probabilistic reachability query verifies whether two objects are “reachable” with a probability no less than a threshold. To enable efficient processing of probabilistic reachability queries on large uncertain contact networks, first, we present a series-parallel reduction technique that significantly reduces the size of the input uncertain contact network in order to shrink the search space while maintaining accuracy and second, we introduce Optimized Spatiotemporal Tree Cover, an index structure that leverages the spatiotemporal properties of the contact network. With an extensive analytical and empirical study, we demonstrate superiority of our proposed solution versus a baseline solution (i.e., Monte Carlo sampling) and the only other existing solution with 400% and 200% improvement in query processing time on average, respectively.

  • 2017 - Spatiotemporal Range Pattern Queries on Large-scale Co-movement Pattern Datasets

      Authors: Shahab Helmi, F. Banaei-Kashani

      Publication: IEEE BigData '17

      Category: Conference

      Acceptance Rate: 18%

      Link: IEEE

    Abstract: Thanks to recent prevalence of location sensors, collecting massive spatiotemporal datasets containing moving object trajectories has become possible, providing an exceptional opportunity to derive interesting insights about behavior of the moving objects such as people, animals, and vehicles. In particular, mining patterns from co-movements of objects (such as players of a sports team, joints of a person while walking, and cars in a transportation network) can lead to the discovery of interesting patterns (e.g., offense tactics of the sports team, gait signature of the person, and driving behaviors causing heavy traffic). With our prior work, we proposed efficient algorithms to mine frequent co-movement patterns from trajectory datasets. In this paper, we focus on the problem of efficient query processing on massive co-movement pattern datasets generated by such pattern mining algorithms. Given a dataset of frequent co-movement patterns, various spatiotemporal queries can be posed to retrieve relevant patterns among all generated patterns from the pattern dataset. We term such queries “pattern queries”. Co-movement patterns are often numerous due to combinatorial complexity of such patterns, and therefore, co-movement pattern datasets grow very large, rendering naive execution of the pattern queries ineffective. In this paper, we propose novel index structures and query processing algorithms for efficient answering of two families of range pattern queries on massive co-movement pattern datasets, namely, spatial range pattern queries and temporal range pattern queries. Our extensive empirical studies with three real datasets have demonstrated the efficiency of the proposed methods.

  • 2017 - Development and Evaluation of a Similarity Measure for Medical Event Sequences

      Authors: Michael Mannino, Joel Fredrickson, Farnoush Banaei-Kashani, Iris Linck and Raghda Alqurashi

      Publication: ACM Transactions on Management Information Systems (TMIS), Vol 8, Issue 2, 2017.

      Category: Journal

      Link: ACM Digital Library

  • 2017 - Efficient Maximal Reverse Skyline Query Processing

      Authors: F. Banaei-Kashani, P. Ghaemi, B. Movaqar, S. J. Kazemitabar

      Publication: GeoInformatica (2017): 1-24

      Category: Journal

      Link: ACM Digital Library

    Abstract: Given a set S of sites and a set O of objects in a metric space, the Optimal Location (OL) problem is about computing a location in the space where introducing a new site (e.g., a retail store) maximizes the number of the objects (e.g., customers) that would choose the new site as their “preferred” site among all sites. However, the existing solutions for the optimal location problem assume that there is only one criterion to determine the preferred site for each object, whereas with numerous real-world applications multiple criteria are used as preference measures. For example, while a single criterion solution might consider the metric distance between the customers and the retail store as the preference measure, a multi-criteria solution might consider the annual membership cost as well as the distance to the retail store to find an optimal location. In this paper, for the first time we develop an efficient and exact solution for the so-called Multi-Criteria Optimal Location (MCOL) problem that can scale with large datasets. Toward that end, first we formalize the MCOL problem as maximal reverse skyline query (MaxRSKY). Given a set of sites and a set of objects in a d-dimensional space, MaxRSKY query returns a location in the space where if a new site s is introduced, the size of the (bichromatic) reverse skyline set of s is maximal. To the best of our knowledge, this paper is the first to define and study MaxRSKY query. Accordingly, we propose a filter-based solution, termed EF-MaxRSKY, that effectively prunes the search space for efficient identification of the optimal location. Our extensive empirical analysis with both real and synthetic datasets show that EF-MaxRSKY is invariably efficient in computing answers for MaxRSKY queries with large datasets containing thousands of sites and objects.

  • 2017 - Efficient Processing of Spatiotemporal Pattern Queries on Historical Frequent Co-Movement Pattern Datasets

      Authors: S. Helmi, F. Banaei-Kashani

      Publication: MATES '17, VLDB

      Category: Workshop

      Link: Springer

    Abstract: Thanks to recent prevalence of location sensors, collecting massive spatiotemporal datasets containing moving object trajectories has become possible, providing an exceptional opportunity to derive interesting insights about behavior of the moving objects such as people, animals, and vehicles. In particular, mining patterns from co-movements of objects (such as movements by players of a sports team, joints of a person while walking, and cars in a transportation network) can lead to the discovery of interesting patterns (e.g., offense tactics of a sports team, gait signature of a person, and driving behaviors causing heavy traffic). Given a dataset of frequent co-movement patterns, various conventional spatial and spatiotemporal queries can be posed to retrieve relevant patterns among all generated patterns from the dataset. We term such queries, pattern queries. Co-movement patterns are often numerous due to combinatorial complexity of such patterns, and therefore, co-movement pattern datasets often grow very large in size, rendering naive execution of the pattern queries ineffective. In this paper, we propose the CPI framework, which offers a variety of index structures for efficient answering of various range pattern queries on massive co-movement pattern datasets, namely, spatial range pattern queries, temporal range (time-slice) pattern queries, and spatiotemporal range pattern queries.

  • 2017 - MVSC-Bench: A Tool to Benchmark Classification Methods for Multivariate Spatiotemporal (demo paper)

      Authors: S. V. Kulkarni, F. Banaei-Kashani

      Publication: SSTD '17

      Category: Conference

      Link: PDF

    Abstract: Applications focusing on analysis of multivariate spatiotemporal series (MVS) have proliferated over the past decade. Researchers in a wide array of domains ranging from action recognition to sports analytics have come forward with novel methods to classify this type of data, but well-defined benchmarks for comparative evaluation of the MVS classification methods are non-existent. We present MVSC-Bench, to target this gap.

  • 2016 - Mining Frequent Episodes from Multivariate Spatiotemporal Event Sequences

      Authors: S. Helmi, F. Banaei-Kashani

      Publication: IWGS '16 Proceedings of the 7th ACM SIGSPATIAL International Workshop on GeoStreaming

      Category: Workshop

      Link: ACM Digital Library

    Abstract: Thanks to recent prevalence of location sensors, collecting massive spatiotemporal datasets containing moving object trajectories has become possible, providing an exceptional opportunity to derive interesting insights about behavior of the moving objects such as people, animals and vehicles. In particular, mining patterns from interdependent co-movements of objects in a group/team (such as players of a sports team, ants of a colony in search of food, and cars in a congested downtown district) can lead to the discovery of interesting patterns (e.g., offense tactics and strategies of a sports team). Various trajectory mining, and in particular frequent episode mining (FEM), approaches have been proposed to discover such patterns from trajectory datasets. However, the existing FEM approaches neither are applicable to multivariate spatial (MVS) event sequences nor consider and leverage all spatial features of the input data. In this paper, we first introduce a Spatial Apriori property which extends the well-known Apriori property to consider the spatial properties of the input data. We present a data preprocessing technique that leverages the aforementioned Spatial Apriori to reduce the search space of our problem by filtering out irrelevant events from a given MVS event sequence. Second, we present the MVS-FEM framework which efficiently discovers co-movements patterns from MVS datasets. The efficiency of our proposed solutions is evaluated using a real dataset.

  • 2016 - A Lightweight And Inexpensive In-ear Sensing System For Automatic Whole-night Sleep Stage Monitoring

      Authors: Nguyen, A., Alqurashi, R., Raghebi, Z., Banaei-Kashani, F., Halbower, A. C., & Vu, T.

      Publication: SenSys '16 Proceedings of the 14th ACM Conference on Embedded Network Sensor System

      Category: Conference

      Link: ACM Digital Library

      21 out of 119 submissions, acceptance ratio: 17.6%)

      Received The Best Paper Award

    Abstract: This paper introduces LIBS, a light-weight and inexpensive wearable sensing system, that can capture electrical activities of human brain, eyes, and facial muscles with two pairs of custom-built flexible electrodes each of which is embedded on an off-the-shelf foam earplug. A supervised non-negative matrix factorization algorithm to adaptively analyze and extract these bioelectrical signals from a single mixed in-ear channel collected by the sensor is also proposed. While LIBS can enable a wide class of low-cost self-care, human computer interaction, and health monitoring applications, we demonstrate its medical potential by developing an autonomous whole-night sleep staging system utilizing LIBS's outputs. We constructed a hardware prototype from off-the-shelf electronic components and used it to conduct 38 hours of sleep studies on 8 participants over a period of 30 days. Our evaluation results show that LIBS can monitor biosignals representing brain activities, eye movements, and muscle contractions with excellent fidelity such that it can be used for sleep stage classification with an average of more than 95% accuracy.

  • 2016 - LIBS: A Low-Cost In-Ear Bioelectrical Sensing Solution for Healthcare Applications

      Authors: Nguyen, A., Raghebi, Z., Banaei-Kashani, F., Halbower, A. C., & Vu, T.

      Publication: Proceedings of the Eighth Wireless of the Students, by the Students, and for the Students Workshop

      Category: Workshop

      Link: ACM Digital Library

    Abstract: Bioelectrical signals representing electrical activities of human brain, eyes, and facial muscles have found widespread use both as important inputs for critical medical issues and as an invisible communication pathway between human and external devices. However, existing techniques for measuring those biosignals require attaching electrodes on the face and do not come in handy sizes for daily usage. Additionally, no study has been capable of providing all three biosignals with high fidelity simultaneously. In this paper, we present a low-cost bioelectrical sensing system, called LIBS, that can robustly collect the biosignal of good quality from inside human ears and extract all those three fundamental biosignals without loss of information. The practicality of LIBS is shown through one real world scenario of a sleep quality monitoring system. Based on preliminary results, we further propose potential healthcare applications utilizing the sensor's outputs for our future research.

  • 2016 - In-ear Biosignal Recording System: A Wearable For Automatic Whole-night Sleep Staging

      Authors: Nguyen, A., Alqurashi, R., Raghebi, Z., Banaei-Kashani, F., Halbower, A. C., Dinh, T., & Vu, T.

      Publication: Proceedings of the 2016 Workshop on Wearable Systems and Applications

      Category: Workshop

      Link: ACM Digital Library

    Abstract: In this work, we present a low-cost and light-weight wearable sensing system that can monitor bioelectrical signals generated by electrically active tissues across the brain, the eyes, and the facial muscles from inside human ears. Our work presents two key aspects of the sensing, which include the construction of electrodes and the extraction of these biosignals using a supervised non-negative matrix factorization learning algorithm. To illustrate the usefulness of the system, we developed an autonomous sleep staging system using the output of our proposed in-ear sensing system. We prototyped the device and evaluated its sleep stage classification performance on 8 participants for a period of 1 month. With 94% accuracy on average, the evaluation results show that our wearable sensing system is promising to monitor brain, eyes, and facial muscle signals with reasonable fidelity from human ear canals.

  • 2015 - Development and Evaluation of Similarity Measures for Medical Event Sequences

      Authors: Michael Mannino, Joel Fredrickson, Iris Linck, Raghda Alqurashi and Farnoush Banaei-Kashani

      Publication: Workshop on Information Technologies and Systems (WITS 2015), Dallas, Texas, 2015

      Category: Workshop

      More: Click Here

  • 2014 - Janus - Multi Source Event Detection and Collection System for Effective Surveillance of Criminal Activity

      Authors: Farnoush Banaei-Kashani, Cyrus Shahabi, Seon Ho Kim, Luciano Nocera, Giorgos Constantinou, Ying Lu, Yinghao Cai, Gérard G. Medioni, Ramakant Nevatia

      Publication: Journal of Information Processing Systems (JIPS), Vol 10, No 1, pp. 1-22

      Category: Journal

      More: Click Here

  • 2014 - A Comparative Study of Two Approaches for Supporting Optimal Network Location Queries

      Authors: P. Ghaemi , K. Shahabi, J. Wilson, F. Banaei-Kashani

      Publication: GeoInformatica, Vol. 18, Issue 2

      Category: Journal

      Download: PDF

  • 2013 - Efficient Batch Processing of Proximity Queries by Optimized Probing

      Authors: Seyed Jalal Kazemitabar, F. Banaei-Kashani, Seyed Jalil Kazemitabar, Dennis McLeod

      Publication: ACMGIS

      Category: Conference

      Download: PDF

  • 2013 - Monitoring Mobility Disorders at Home using 3D Visual Sensors and Mobile Sensors (demo paper)

      Ahthor(s): F. Banaei-Kashani et al.

      Publication: Wireless Health

      Category: Conference

      Download: PDF

  • 2013 - SDPF: A Framework for Online, Real-time Cleansing of Upstream Operating Data

      Authors: F. Banaei-Kashani, M. Asghari, M Rahmani, C. Shahabi, Lisa Brenskelle

      Publication: SPE Western Regional Meeting

      Category: Conference

      Download:

  • 2013 - Users Plan Optimization for Participatory Urban Texture Documentation

      Authors: H. Shirani-Mehr, F. Banaei-Kashani and C. Shahabi

      Publication: GeoInformatica, Vol. 17, Issue 1

      Category: Conference

      Download: PDF

  • 2012 - Continuous Maximal Reverse Nearest Query on Spatial Networks

      Authors: P. Ghaemi , K. Shahabi, J. Wilson, F. Banaei-Kashani

      Publication: ACMGIS

      Category: Conference

      Download: PDF

  • 2012 - Reachability Query in Large Evolving Contact Networks

      Authors: H. Shirani-Mehr, F. Banaei-Kashani and C. Shahabi

      Publication: VLDB

      Category: Conference

      Download: PDF

  • 2011 - A Case Study of Participatory Data Transfer for Urban Temperature Monitoring

      Authors: H. Shirani-Mehr , F. Banaei-Kashani and C. Shahabi

      Publication: W2GIS

      Category: Workshop

      Download: PDF

  • 2011 - Discovering Traffic Patterns in Traffic Sensor Data

      Authors: F. Banaei-Kashani, C. Shahabi and B. Pan

      Publication: IWGS

      Category: Workshop

      Download: PDF

  • 2011 - Online Computation of Fastest Path in Time-Dependent Spatial Networks

      Authors: U. Demiryurek, F. Banaei-Kashani, C. Shahabi and Anand Ranganathan

      Publication: SSTD

      Category: Conference

      Download: PDF

  • 2010 - A Case for Time-Dependent Shortest Path Computation in Spatial Networks

      Authors: U. Demiryurek, F. Banaei-Kashani and C. Shahabi

      Publication: ACMGIS

      Category: Conference

      Download: PDF

  • 2010 - Efficient Approximate Visibility Query in Large Dynamic Environments

      Authors: L. Kazemi, F. Banaei-Kashani, C. Shahabi and R. Jain

      Publication: DASFAA

      Category: Conference

      Download: PDF

  • 2010 - Efficient K-Nearest Neighbor Search in Time-Dependent Spatial Networks

      Authors: U. Demiryurek, F. Banaei-Kashani and C. Shahabi

      Publication: DEXA

      Category: Conference

      Download: PDF

  • 2010 - Fixed-precision approximate continuous aggregate queries in peer-to-peer databases

      Authors: F. Banaei-Kashani and C.Shahabi

      Publication: CollaborateCom

      Category: Conference

      Download: PDF

  • 2010 - GeoDec: A framework to effectively visualize and query geospatial data for decision-making

      Authors: C. Shahabi, F. Banaei Kashani, A. Khoshgozaran and S. Xing

      Publication: IEEE Multimedia Magazine

      Category: Journal

      Download: PDF

  • 2010 - GeoSIM: A GeoSpatial Data Collection System for Participatory Urban Texture Documentation

      Authors: F. Banaei-Kashani, H. Shirani-Mehr, B. Pan, N. Bopp, L. Nocera, C. Shahabi

      Publication: Special Issue of IEEE Data Engineering Bulletin on Spatial and Spatiotemporal Databases

      Category: Journal

      Download: PDF

  • 2010 - Optimal Network Location Queries

      Authors: P. Ghaemi , K. Shahabi, J. Wilson, F. Banaei-Kashani

      Publication: ACMGIS

      Category: Conference

      Download: PDF

  • 2010 - Spatiotemporal Summarization of Traffic Data Streams

      Authors: B. Pan, U. Demiryurek, F. Banaei-Kashani and C. Shahabi

      Publication: IWGS in conjunction with ACMGIS

      Category: Workshop

      Download: PDF

  • 2010 - TransDec: A Spatiotemporal Query Processing Framework for Transportation Systems

      Authors: U. Demiryurek, F. Banaei-Kashani and C. Shahabi

      Publication: ICDE

      Category: Conference

      Download: PDF

  • 2010 - Using Location-based Social Networks for Quality-Aware Participatory Data Transfer

      Authors: H. Shirani-Mehr , F. Banaei-Kashani and C. Shahabi

      Publication: ACMGIS

      Category: Conference

      Download: PDF

  • 2010 - Voronoi-based Geospatial Query Processing with MapReduce

      Authors: A. Akdogan, U. Demiryurek, F. Banaei-Kashani and C. Shahabi

      Publication: IEEE CloudCom

      Category: Conference

      Download: PDF

  • 2009 - Applications of sensor network data management

      Authors: F. Banaei-Kashani and C. Shahabi

      Publication: Springer

      Category: Book Chapter

      Download: PDF

  • 2009 - Case study: Scoop up data from peer-to-peer databases, Book chapter in Handbook of Peer-to-Peer Networks

      Authors: F. Banaei-Kashani and C. Shahabi

      Publication: Springer

      Category: Book Chapter

      Download: PDF

  • 2009 - Efficient Viewpoint Assignment for Urban Texture Documentation

      Authors: H. Shirani-Mehr, F. Banaei-Kashani and C. Shahabi

      Publication: ACMGIS

      Category: Conference

      Download: PDF

  • 2009 - Efficient Viewpoint Selection for Urban Texture Documentationor query in spatial networks using Euclidian Restriction

      Authors: H. Shirani-Mehr, F. Banaei-Kashani and C. Shahabi

      Publication: GSN

      Category: Conference

      Download: PDF

  • 2009 - ER-CkNN: Efficient continuous nearest neighbor query in spatial networks using Euclidian Restriction

      Authors: U. Demiryurek, F. Banaei-Kashani and C. Shahabi

      Publication: SSTD

      Category: Conference

      Download: PDF

  • 2009 - GeoDec: A Multi-Layered Query Processing Framework for Spatiotemporal Data

      Authors: L. Nocera, A. Rihan, S. Xing, A. Khodaei, A. Khoshgozaran, C. Shahabi, F. Banaei-Kashani

      Publication: ACMGIS

      Category: Conference

      Download: PDF

  • 2009 - Temporal Modeling of Spatiotemporal Networks

      Authors: U. Demiryurek, B. Pan, F. Banaei-Kashani and C. Shahabi

      Publication: IWCTS in conjunction with ACMGIS

      Category: Conference

      Download: PDF

  • 2008 - Fixed-precision approximate continuous aggregate queries in peer-to-peer databases

      Authors: F. Banaei-Kashani and C. Shahabi

      Publication: ICDE

      Category: Conference

      Download: PDF

  • 2008 - On-the-fly Visualization of Scientific Geospatial Data Using Waveltes

      Authors: C. Shahabi, F. Banaei-Kashani and K. Song

      Publication: Microsoft e-Science Workshop

      Workshop: Conference

      Download: PDF

  • 2008 - Partial read from peer-to-peer databasesJournal of Computer Communications, Vol. 31, No. 2

      Authors: F. Banaei-Kashani and C. Shahabi

      Publication: Journal of Computer Communications, Vol. 31, No. 2

      Category: Journal

      Download: PDF

  • 2008 - ProDA: An end-to-end wavelet-based OLAP system for massive datasets

      Authors: C. Shahabi, M. Jahangiri and F. Banaei-Kashani

      Publication: IEEE Computer, Vol. 41, No. 4

      Category: Journal

      Download: PDF

  • 2008 - SWAM: A family of access methods for similarity-search in peer-to-peer data networks

      Authors: C. Shahabi, M. Jahangiri and F. Banaei-Kashani

      Publication: IEEE Computer, Vol. 41, No. 4

      Category: Conference

      Download: PDF

  • 2007 - Modeling peer-to-peer data networks under complex system theory

      Authors: C. Shahabi and F. Banaei-Kashani

      Publication: International Journal of Computational Science and Engineering (IJCSE)

      Category: Journal

      Download: PDF

  • 2006 - Partial selection query in peer-to-peer databases

      Authors: C. Shahabi and F. Banaei-Kashani

      Publication: ICDE

      Category: Conference

      Download: PDF

  • 2004 - WSPDS: Web Services Peer-to-peer Discovery Service

      Authors: F. Banaei-Kashani, C. Chen and C. Shahabi

      Publication: ISWS

      Category: Conference

      Download: PDF

  • 2003 - Criticality-based analysis and design of unstructured peer-to-peer networks as complex systems

      Authors: F. Banaei-Kashani, C. Chen and C. Shahabi

      Publication: 3rd International Workshop on Global and Peer-to-Peer Computing (GP2PC) in conjunction with CC-Grid

      Category: Workshop

      Download: PDF

  • 2003 - Efficient and anonymous web usage mining for web personalization

      Authors: C. Shahabi and F. Banaei-Kashani

      Publication: INFORMS Journal on Computing Special Issue on Mining Web-Based Data for e-Business Applications, Vol. 15, No. 2, Spring

      Category: Journal

      Download: PDF

  • 2003 - Efficient flooding in power-law networks

      Authors: F. Banaei-Kashani and C. Shahabi

      Publication: PODC

      Category: Conference

      Download: PDF

  • 2003 - Searchable querical data networks

      Authors: F. Banaei-Kashani and C. Shahabi

      Publication: International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P) in con

      Category: Workshop

      Download: PDF

  • 2002 - Decentralized resource management for a distributed continuous media server

      Authors: C. Shahabi and F. Banaei-Kashani

      Publication: IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS), Vol. 13, No. 7

      Category: Journal

      Download: PDF

  • 2001 - A framework for efficient and anonymous web usage mining based on client-side tracking

      Authors: Book Chapter: C. Shahabi and F. Banaei-Kashani

      Publication: Lecture Notes in Computer Science, Vol. 2356

      Category: Book Chapter

      Download: PDF

  • 2001 - A reliable, efficient, and scalable system for web usage data acquisition

      Authors: C. Shahabi, F. Banaei-Kashani and J. Faruque

      Publication: Workshop on Web Mining and Web Usage Analysis (WebKDD) in conjunction with KDD Conference

      Category: Workshop

      Download: PDF

  • 2001 - Feature Matrices: A model for efficient and anonymous web usage mining

      Authors: C. Shahabi, F. Banaei-Kashani, J. Faruque and A. Faisal

      Publication: ECWeb

      Category: Workshop

      Download: PDF

  • 2001 - Yoda: An accurate and scalable web-based recommendation system

      Authors: C. Shahabi, F. Banaei-Kashani, Y. Chen and D. McLeod

      Publication: CoopIS

      Category: Conference

      Download: PDF