MBZUAI Institutional Repository

Welcome to MBZUAIRep, the MBZUAI institutional repository. It is the hub for collecting and preserving the university research output. The library manages this service to collect academic journal articles, conference proceedings, books, book chapters, theses, and dissertations from MBZUAI faculty members, students, staff, and researchers.

Featured Items

Categorizing robots by performance fitness into the tree of robots

Sami Haddadin

Robots are typically classified based on specific morphological features, like their kinematic structure. However, a complex interplay between morphology and intelligence shapes how well a robot performs processes. Just as delicate surgical procedures demand high dexterity and tactile precision, manual warehouse or construction work requires strength and endurance. These process requirements necessitate robot systems that provide a level of performance fitting the process. In this work, we introduce the tree of robots as a taxonomy to bridge the gap between morphological classification and process-based performance. It classifies robots based on their fitness to perform, for example, physical interaction processes. Using 11 industrial manipulators, we constructed the first part of the tree of robots based on a carefully deduced set of metrics reflecting fundamental robot capabilities for various industrial physical interaction processes. Through significance analysis, we identified substantial differences between the systems, grouping them via an expectation-maximization algorithm to create a fitness-based robot classification that is open for contributions and accessible.

21/02/2025

Safeguarding connected autonomous vehicle communication: Protocols, intra- and inter-vehicular attacks and defenses

Mohsen Guizani

The advancements in autonomous driving technology, coupled with the growing interest from automotive manufacturers and tech companies, suggest a rising adoption of Connected Autonomous Vehicles (CAVs) in the near future. Despite some evidence of higher accident rates in AVs, these incidents tend to result in less severe injuries compared to traditional vehicles due to cooperative safety measures. However, the increased complexity of CAV systems exposes them to significant security vulnerabilities, potentially compromising their performance and communication integrity. This paper contributes by presenting a detailed analysis of existing security frameworks and protocols, focusing on intra- and inter-vehicle communications. We systematically evaluate the effectiveness of these frameworks in addressing known vulnerabilities and propose a set of best practices for enhancing CAV communication security. The paper also provides a comprehensive taxonomy of attack vectors in CAV ecosystems and suggests future research directions for designing more robust security mechanisms. Our key contributions include the development of a new classification system for CAV security threats, the proposal of practical security protocols, and the introduction of use cases that demonstrate how these protocols can be integrated into real-world CAV applications. These insights are crucial for advancing secure CAV adoption and ensuring the safe integration of autonomous vehicles into intelligent transportation systems.

2025-02-06

Applications of artificial intelligence in public health: analyzing the built environment and addressing spatial inequities

Eric Xing

Aim: To review the application of artificial intelligence (AI), specifically computer vision, in analyzing built environment (BE) characteristics within public health research, with a focus on spatial equity. Subject and methods: We conducted a rapid review of peer-reviewed articles (2014–2024) in English that integrated AI or computer vision in public health research on the BE. Following JBI and PRISMA guidelines, with a registered PROSPERO protocol, we searched Web of Science, PubMed, and Scopus databases. Data were extracted using a JBI-adapted template and synthesized descriptively, focusing on methods, key findings, and spatial equity elements. Results: Ten cross-sectional studies, predominantly from urban areas in the USA and China, met the inclusion criteria. These studies used computer vision to analyze BE features such as roads, greenery, and buildings through street view or satellite images. Health outcomes examined included physical activity, mental health, obesity, and mortality. Findings consistently showed positive health associations with increased greenery and improved street infrastructure. However, spatial equity was minimally addressed, with only one study (10%) considering this aspect. Conclusion: While AI applications in public health research on the BE show promise, there is a need for further research to address spatial equity and ensure findings are inclusive and relevant across diverse populations and contexts.

2025-03-19

Recent Submissions

BAKER: Bayesian Kernel Uncertainty in Domain-Specific Document Modelling
(Association for Computing Machinery, 10/03/2025) Imran Razzak
In critical domains such as healthcare and law, accurately modelling the uncertainty of automatic computational models is essential. For instance, healthcare models must produce reliable estimates to guide human decision-making. However, modelling uncertainty remains challenging, particularly for models handling low-resource datasets and complex, domain-specific vocabulary. Most existing predictive models model point estimates rather than probability distributions, limiting our ability to quantify model uncertainty. This paper introduces a novel model, BAKER, designed to address these limitations. BAKER combines the strengths of Bayesian inference, known for its effectiveness in modelling uncertainty, and kernel methods, which excel at capturing complex data relationships. Incorporating kernel functions enhances model performance, particularly by reducing overfitting in data-limited scenarios. Our experimental analysis shows that BAKER significantly improves uncertainty reasoning compared to existing models.
Large Language Model Simulator for Cold-Start Recommendation
(Association for Computing Machinery, 21/05/2025) Fakhri Karray
Recommending cold items remains a significant challenge in billion-scale online recommendation systems. While warm items benefit from historical user behaviors, cold items rely solely on content features, limiting their recommendation performance and impacting user experience and revenue. Current models generate synthetic behavioral embeddings from content features but fail to address the core issue: the absence of historical behavior data. To tackle this, we introduce the LLM Simulator framework, which leverages large language models to simulate user interactions for cold items, fundamentally addressing the cold-start problem. However, simply using LLM to traverse all users can introduce significant complexity in billion-scale systems. To manage the computational complexity, we propose a coupled funnel ColdLLM framework for online recommendation. ColdLLM efficiently reduces the number of candidate users from billions to hundreds using a trained coupled filter, allowing the LLM to operate efficiently and effectively on the filtered set. Extensive experiments show that ColdLLM significantly surpasses baselines in cold-start recommendations, including Recall and NDCG metrics. A two-week A/B test also validates that ColdLLM can effectively increase the cold-start period GMV.
GAMED: Knowledge Adaptive Multi-Experts Decoupling for Multimodal Fake News Detection
(Association for Computing Machinery, 10/03/2025) Imran Razzak
In critical domains such as healthcare and law, accurately modelling the uncertainty of automatic computational models is essential. For instance, healthcare models must produce reliable estimates to guide human decision-making. However, modelling uncertainty remains challenging, particularly for models handling low-resource datasets and complex, domain-specific vocabulary. Most existing predictive models model point estimates rather than probability distributions, limiting our ability to quantify model uncertainty. This paper introduces a novel model, BAKER, designed to address these limitations. BAKER combines the strengths of Bayesian inference, known for its effectiveness in modelling uncertainty, and kernel methods, which excel at capturing complex data relationships. Incorporating kernel functions enhances model performance, particularly by reducing overfitting in data-limited scenarios. Our experimental analysis shows that BAKER significantly improves uncertainty reasoning compared to existing models.
Assessing the effects of financial toxicity on quality of life among hematopoietic stem cell transplantation recipients.
(LIPPINCOTT WILLIAMS & WILKINS, 28/05/2025) Shahrukh Hashmi
Background: “Financial toxicity” refers to the financial burden imposed by treatment costs on individuals with cancer, constituting a major barrier to achieving equitable cancer outcomes. Recent literature increasingly demonstrates the detrimental impacts of financial toxicity on quality of life (QOL) among individuals with cancer, including individuals who have undergone hematopoietic stem cell transplantation (HSCT). This study evaluates associations among treatment cost burden and various aspects of QOL following HSCT. Methods: Seven hundred one HSCT recipients completed a survey examining their biopsychosocial health one year following transplant. The survey included the Functional Assessment of Cancer Therapy – Bone Marrow Transplantation (FACT-BMT), a multifactorial measure of QOL specific to this population. Treatment cost burden endorsement was measured on a 5-item Likert scale. Hierarchical regression models were developed to assess the incremental effects of demographic characteristics (i.e., Block 1), clinical predictors (Block 2), and cost burden (Block 3) on physical, emotional, social, functional, BMT-specific, general, and composite QOL outcomes. Results: Significant model improvement was observed with the addition of clinical factors (ΔF(2,650) = 20.28, p < .001), and subsequently, treatment cost burden (ΔF(1,649) = 110.29, p < .001). In the final model, higher cost burden was associated with poorer physical (β = -0.323, p < .001), emotional (β = -0.301, p < .001), social (β = -0.250, p < .001), functional (β = -0.317, p < .001), BMT-specific (β = -0.341, p < .001), general (β = -0.377, p < .001), and composite QOL (β = -0.381, p < .001). Poorer performance score was associated with each QOL indicator (p < .001), with allogeneic transplant type associated with poorer functional (β = -0.001, p = .002), but higher emotional (β = 0.118, p = .002), wellbeing. Older age (β = 0.113, p = .003) and female sex predicted higher (β = 0.183, p < .001), while Hispanic ethnicity predicted poorer (β = -0.095, p = .010), social wellbeing. Female sex was associated with poorer QOL specific to BMT concerns (β = -0.118, p = .001). Conclusions: Higher treatment cost burden is associated with poorer overall QOL and its physical, emotional, social, functional, and BMT-specific components one year following HSCT, after controlling for demographic and clinical characteristics. This reflects a critical barrier to equitable cancer care, suggesting that financial toxicity may perpetuate preexisting inequities in QOL, treatment, disease, and survival outcomes that disproportionately impact the underserved. Future research should prioritize 1) better understanding relationships among complex indicators of financial toxicity, QOL, and their underpinning mechanisms and 2) developing solutions to mitigate financial toxicity of HSCT and overall cancer care.
XGBoost-Liver: An Intelligent Integrated Features Approach for Classifying Liver Diseases Using Ensemble XGBoost Training Model.
(Tech Science Press, 26/03/2025) Salman Khan
The liver is a crucial gland and the second-largest organ in the human body and also essential in digestion, metabolism, detoxification, and immunity. Liver diseases result from factors such as viral infections, obesity, alcohol consumption, injuries, or genetic predispositions. Pose significant health risks and demand timely diagnosis and treatment to enhance survival rates. Traditionally, diagnosing liver diseases relied heavily on clinical expertise, often leading to subjective, challenging, and time-intensive processes. However, early detection is essential for effective intervention, and advancements in machine learning (ML) have demonstrated remarkable success in predicting various conditions, including Chronic Obstructive Pulmonary Disease (COPD), hypertension, and diabetes. This study proposed a novel XGBoost-liver predictor by integrating distinct feature methodologies, including Ranking and Statistical Projection-based strategies to detect early signs of liver disease. The Fisher score method is applied to perform global interpretation analysis, helping to select optimal features by assessing their contributions to the overall model. The performance of the proposed model has been extensively evaluated through k-fold cross-validation tests. Firstly, the performance of the proposed model is evaluated using individual and hybrid features. Secondly, the XGBoost-Liver model performance is compared to that of commonly used classifier algorithms. Thirdly, its performance is compared with the existing state-of-the-art computational models. The experimental results show that the proposed model performed better than the existing predictors, reaching an average accuracy rate of 92.07%. This paper demonstrates the potential of machine learning to improve liver disease prediction, enhance diagnostic accuracy, and enable timely medical interventions for better patient outcomes.

Communities in MBZUAI iRep

Select a community to browse its collections.

Now showing 1 - 5 of 9