Mehbubul Hasan Al-Quvi

Systems Researcher | Prospective PhD Student

Email: quvi007 [at] gmail [dot] com

Curriculum vitae

About Me

Hi, Quvi here. I am a systems research intern at OrderLab, University of Michigan, Ann Arbor. My research interests lie in the intersection of Operating Systems, Storage Systems, Computer Architecture, Compilers, and System Security. My current research focuses on reliable kernel extensibility, and binary lifting, recompilation frameworks for security. I am a recent graduate from the Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology (BUET).

Currently, I am looking for a Ph.D. position in OS/Architecture/Compilers/Security.

Research Papers

Yiming Xiang, Wanning He, Mehbubul Hasan Al-Quvi, and Peng Huang

Submitted to 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2025) - CORE Rank A*

A Framework for Adding Runtime-Checks in eBPF Programs


ePass

Mehbubul Hasan Al-Quvi, Shamit Fatin, Sukarna Barua, and Anindya Iqbal

Under evaluation at Samsung Research Bangladesh (SRBD)

LELANTE: LEveraging LLM for Automated ANdroid Test Execution


LELANTE_EXEC

⮞ Details

Automated mobile application testing faces two critical challenges: (1) executing natural language test specifications without manual scripting, and (2) reliably verifying test outcomes across diverse application states. Existing approaches often require specialized scripting languages or rely on fragile record-and-replay mechanisms, which increase maintenance costs and reduce verification reliability. To address these issues, we introduce LELANTE, a novel framework that leverages large language models (LLMs) to automate both execution and verification of natural language test cases. LELANTE employs a two-stage pipeline: an LLM interprets and executes test steps via interaction with the device's graphical user interface (GUI), while another language model validates execution outcomes against expected conditions. In experiments across 390 test cases spanning 10 popular Android applications, LELANTE achieved an 85% execution success rate and an 85% verification accuracy using open-source models on standard hardware. These results show that LLMs can effectively bridge the gap between natural language test specifications and automated test execution, making mobile testing more accessible without compromising reliability.

H.A.Z. Sameen Shahgir, Mehbubul Hasan Al-Quvi, Shamit Fatin, Sukarna Barua, and Anindya Iqbal

Under preparation for submission to Autonomous Agents and Multi-Agent Systems (AAMAS 2025)

DistilAgent: Distilling Agentic Capabilities from Large Language Models for Long-Horizon UI Navigation


DistilAgent

⮞ Details

Large Language Models (LLMs) have shown remarkable potential as autonomous agents in diverse environments, but their agentic capabilities are strongly contingent on their parameter count and context window size. While closed-source LLMs such as GPT-4 and Claude are already quite capable, smaller open-source models struggle with autonomous UI navigation tasks. To bridge this gap, we propose DistilAgent, a novel framework that distills the capabilities of long-context closed-source LLMs into smaller models , significantly improving their performance in autonomous UI navigation. DistilAgent incorporates three key components: task discretization to break down long-horizon tasks into smaller sub-tasks that fit within the context window of smaller models, explicit reasoning enforcement to ensure the LLM reasons about grounding and the current state of the environment, and exploration encouragement to prompt the LLM to explore the UI when required actions are not immediately available on the current screen. Our results demonstrate that DistilAgent improves the task completion rate of GPT4 from 50% to 90%. We collected AutoNav1K - a synthetic dataset of 1,000 GPT4 task executions using DistilAgent. After finetuning on AutoNav1K, the task completion rate of LLaMA-3.1-8B improved from 0% to 60%. DistilAgent showcases the potential for distilling the agentic capabilities of large, closed-source LLMs into smaller models, democratizing access to high-performance autonomous UI navigation.

Experience

Research Intern

June 2024 - Present (Remote)

https://orderlab.io

Systems Researcher

January 2025 - Present

https://cse.buet.ac.bd

Advised by Prof. Anindya Iqbal

EDGE Research Laboratory (EdgeLab@BUET)

Department of Computer Science and Engineering, BUET

Research Assistant

February 2024 - December 2024

https://cse.buet.ac.bd

Advised by Professor Anindya Iqbal and Professor Sukarna Barua

SAMSUNG Applied Machine Learning Research Laboratory (SRBDLab@BUET)

Room 901, ECE Building, BUET

Education

Bachelor of Science

April 2019 - July 2024

Computer Science and Engineering

Bangladesh University of Engineering and Technology (BUET)

Thesis: Analyzing Large Language Models in Low-Resource Settings advised by Professor Muhammad Abdullah Adnan

Grants and Fellowships

  • Selected for RISE Research Grant 2023-24
  • Selected for Deans’ List Award 2019-24
  • Selected for Merit Scholarship 2019-24
  • Selected for Admission Test Scholarship 2019
  • Selected for Technical Scholarship 2019-24

Awards and Honors

  • Runner Up Student Poster Award - RISE Research Grant, 2023
  • 2nd Placed Team (BUET TLE) - Intra-BUET Programming Contest, 2019

News

  • [Jan 2025] Started as a Systems Researcher at EdgeLab, BUET.
  • [Dec 2024] Left Samsung Applied Machine Learning Research Laboratory, BUET.
  • [Jun 2024] Started as a Remote Research Intern at OrderLab, U-M.
  • [Jun 2024] Defended my Bachelor’s thesis. Thanks to my supervisor.

A Little More About Me

In my spare time, I love exploring new frameworks, programming languages, and Linux distributions. I enjoy building small apps, software, and scripts for personal use.

I became a self-taught programmer during my teenage years, driven by curiosity and a passion for learning. My coding journey began at 13 when I ran my first C program. Back in 2012, as a 7th grader, I was fascinated by web development—creating websites with HTML, CSS, JavaScript, jQuery, and PHP. That same year, I taught myself C programming, followed by JavaScript, Java, Android Development, Visual Basic.NET, C#, SQL, and more. I even experimented with game development in Unity but quickly lost interest in it, and shifted my focus to Security and Penetration Testing.

During high school, two of my most significant projects were a Remote Administration Tool and an Android app for emergency situations: EmerApp. My passion for problem-solving and software development continued to grow, and in 2018, I secured the 8th position in the BUET admission test (out of 10000 candidates).

C/C++ remains my most favorite programming language for building system software. For web development, I prefer Javascript, and I enjoy using it for both frontend development and Node.js applications. Currently, I’m working on a hobby project, Project Algorist, which aims to make learning data structures and algorithms more accessible for the newcomer community—completely free. I actively solve problems on AtCoder, store my projects on GitHub, and love reading e-books from The Pragmatic Programmers.

My other hobbies are reading books, travelling, watching movies, and playing video games. I’m a huge fan of the DC Comics and the Mr. Robot series. I also enjoy playing games like GTA San Andreas, GTA V, and Call of Duty. I’m a big fan of the Call of Duty series and have played every game in the franchise.

Here is my childhood website (one of the dearest memories I have): Quvi’s Website

I’m always open to new opportunities, collaborations, and discussions. Feel free to reach out to me via email or LinkedIn.

Thanks to my visitors worldwide. Here is a live visitor map of my website: