About

Hi — I'm Akshay, a Human-Centered AI researcher working at the intersection of multimodal interaction, intelligent interfaces, and usability engineering. My core question is deceptively simple: when people speak to, gesture at, or glance toward an AI system, does it actually understand them? And if not — why not, and how do we measure the gap?

My research builds and evaluates AI systems that integrate speech, gesture, and visual input as first-class modalities — not as afterthoughts bolted onto existing interfaces. I develop psychometric scales, run empirical studies in both controlled and real-world settings, and model the cognitive costs that accumulate when people must switch between interaction modes. This work has produced two validated evaluation instruments: a usability scale purpose-built for voice user interfaces, and a context-aware scale for multimodal automotive interfaces.

Over the past decade I have moved fluidly between academia and industry — stress-testing large language models and foundation models at Fraunhofer IAIS, building a standards-aligned voice evaluation lab at Deutsche Telekom, and conducting early HCI research at Fraunhofer IGD. That back-and-forth between rigorous evaluation and real deployment is the lens through which I approach every research question.

"In the world of voice interfaces, the best designs are those that don't just respond, but truly listen to the human heart."

— Akshay Deshmukh, Ph.D. Dissertation (2025)

A bit more about me

I was born and raised in the city of Bengaluru, India — the Garden City, where the air smells of jasmine and filter coffee, and where I spent my early years equally fascinated by how things worked and why people behaved the way they did around them. That curiosity never left.

Since then I have been, professionally speaking, something of a nomad — moving from India to Germany and staying put just long enough in each city to develop opinions about its public transport. Fun fact: if you plot the coordinates of every city I have lived in for an extended period on a globe, they trace a surprisingly coherent arc from the Indian subcontinent to central Europe.

Bengaluru Darmstadt Sankt Augustin Freiberg

Outside of research, I trek whenever mountains are within reasonable striking distance, grow cherry tomatoes on my terrace in Darmstadt (mangoes back home in Bengaluru, naturally), and have been known to represent my home state in table tennis — though I now primarily use that skill to win arguments about reaction time in HCI experiments.

I also practice yoga and meditation, which I find essential for anyone who spends their days thinking about cognitive load.

fun fact 01

My career trajectory follows a Fitts' Law curve — the further the target (tenure), the longer the movement time, but the larger the target gets as you approach it.

fun fact 02

I have collected more usability questionnaire responses than I have eaten dosas. Both numbers are in the thousands.


Currently

Scientific Employee / Assistant Professor at TU Bergakademie Freiberg, Germany (Nov 2024 – Present). I teach Human-Centered AI, Intelligent User Interfaces, and Multimodal Systems across B.Sc. and M.Sc. programmes, supervise theses on topics ranging from driver emotion detection to Human-in-the-Loop AI design, and pursue active research in ubiquitous computing and XR-based interaction.


Previously

As Senior HMI-UX Researcher at Fraunhofer IAIS (2022–2024), I led the human-centered evaluation of OpenGPT-X and the Foundation Model Playground — defining what "usable" means for large-scale AI systems when the users aren't engineers. Before that, as UX Research Engineer at Deutsche Telekom (2019–2022), I established a Voice Lab from the ground up, aligning evaluation protocols with ETSI and Alexa standards and applying mixed-method assessment to voice assistants deployed at scale.

I hold a Ph.D. in Human-Computer Interaction (2020–2025) from the Universitat Jaume I / Fraunhofer IAIS, and an M.Sc. in Distributed Software Systems from TU Darmstadt.

Research Areas

Multimodal AI Systems

Building and evaluating AI systems that treat speech, gesture, and visual input as equal, interchangeable modalities. A key focus is the Modality-Switch Cognitive Cost — quantifying what users actually lose when they are forced to change interaction channels mid-task.

Usability & UX Evaluation

Designing evaluation instruments that go beyond generic questionnaires. This includes the VUIQ (voice usability) and CAUS (context-aware automotive usability) — purpose-built, empirically validated scales for AI-driven interfaces where standard tools fall short.

Voice User Interfaces

Investigating how conversational AI performs across real users, real accents, and real conditions — not just clean speech in ideal environments. Research spans multilingual VUIs, speaker personalization, and deployment robustness in healthcare and smart-home contexts.

Human-AI Interaction

Designing AI systems that respond not just to what users say, but to how they feel. This includes affect-adaptive dialogue agents, Human-in-the-Loop (HITL) learning pipelines, and XR interfaces that adapt based on detected cognitive load and emotional state.

→ Looking to pursue a Master's thesis on multimodal interaction, VUI evaluation, or Human-AI collaboration? I supervise motivated students working on empirically grounded, publishable research. Reach out with a brief statement of interest.

Selected Projects
CAUS TOUCH VOICE GEST α = 0.91 · N=36 · 216 trials Voice > Gesture > Touch Context-Aware Usability Scale · AutoUI 2025
Usability in a moving vehicle isn't the same as usability on a desktop. CAUS is a 20-item psychometric scale purpose-built to evaluate multimodal interaction — touch, voice, and gesture — under real driving conditions. Validated through a controlled on-road study (N=36, 216 trials), CAUS achieved high internal consistency (α=0.91) and showed voice interaction significantly outperforms touch in high-complexity urban traffic.
Automotive HCIUsability ScaleMultimodalAutoUI 2025
😊 Happy · 76.7% 😢 Sad · 51.7% 😠 Angry · 63.9% EIVA RESPONSE Celebratory · "Yay! Amazing!" Motivational · "Keep it up!" Neutral · "That's great!" 90% prefer repair ✓ EIVA · N=30 · 1,620 utterances Alignment > Accuracy · BiLSTM + Attention Affect-Adaptive Voice Assistant · CUI 2026
What matters more — detecting your emotion correctly, or responding to it well? EIVA is a modular research platform built on a BiLSTM + attention speech emotion model (64.1% real-world accuracy) with expressive LLM-generated responses. A controlled study (N=30, 1,620 utterances) found expressive strategies matter far more than detection accuracy — and 90% of users want repair when the system gets it wrong.
Affective ComputingVoice AIEmotion RecognitionCUI 2026
VOICE +MSCC TOUCH +MSCC GESTURE cognitive load EEG MSCC — Event-Level Switch Cost Metric Quantifying the usability tax of modality switching Multimodal Interaction · MUM 2025
Every time a user switches from voice to touch — or touch to gesture — there is a measurable cognitive cost. MSCC is a novel event-level usability metric that quantifies exactly that cost, giving multimodal designers a first-class instrument to measure the hidden usability tax of switching. Rather than a design choice, modality switching is now a measurable, optimisable variable.
Multimodal HCICognitive LoadUsability MetricMUM 2025
▶ DEMO
Traditional usability evaluations are expensive, time-consuming, and mostly one-shot affairs. This M.Sc. thesis project (Fraunhofer IGD, TU Darmstadt) built a web-based platform that automates the full evaluation lifecycle — from questionnaire distribution to result analysis — making continuous, repeatable UX measurement possible within an agile development cycle. The tool integrates SUS, AttrakDiff, and INTUI scales and was validated on the Smarter Privacy project.
Usability AutomationUX ToolsHCIM.Sc. Thesis · 2018

→ Add screenshots or demo images to each card to bring these projects to life visually.

News
  • 2025Paper Towards Context-Aware Usability Assessments in Vehicles: CAUS Scale accepted at AutoUI 2025DOI
  • 2025Dissertation published: Measuring User Experience in Voice and Multimodal User InterfacesUniversitat Jaume I
  • 2025Paper Mind the Switch: Measuring Modality-Switch Cognitive Costs (MSCC) published in MUM '25DOI
  • 2025Paper AI-Based Framework for Usability in Digital Health accepted in Information Research
  • Nov 2024Joined TU Bergakademie Freiberg as Scientific Employee / Assistant Professor
  • 2024Paper User Experience and Usability of Voice User Interfaces: A Systematic Literature Review in InformationDOI
  • 2024Paper Validation of System Usability Scale as a Usability Metric to Evaluate Voice User Interfaces in PeerJ Computer Science
  • 2022–24Led user-centered design & AI evaluation for OpenGPT-X at Fraunhofer IAIS
  • 2019–22Established Voice Lab for AI system testing at Deutsche Telekom, aligned with ETSI and Alexa standards
Publications
2025
Mind the Switch: Measuring Modality-Switch Cognitive Costs (MSCC) as a Metric for Multimodal Interaction
Akshay Madhav Deshmukh, Bastian Pfleging
MUM '25 – 24th Intl. Conference on Mobile and Ubiquitous Multimedia→ DOI
Towards Context-Aware Usability Assessments in Vehicles: Developing the CAUS Scale for Multimodal Interfaces
Akshay Madhav Deshmukh
AutoUI 2025 – Adjunct Proceedings→ DOI
Measuring User Experience in Voice and Multimodal User Interfaces (Ph.D. Dissertation)
Akshay Madhav Deshmukh
Universitat Jaume I→ Repository
AI-Based Framework for Usability in Digital Health
Deshmukh, A. M.
Information Research Accepted
2024
User Experience and Usability of Voice User Interfaces: A Systematic Literature Review
Deshmukh, A. M., & Chalmeta, R.
Information, 15(9), 579→ DOI
Validation of System Usability Scale as a Usability Metric to Evaluate Voice User Interfaces
Deshmukh, A. M., & Chalmeta, R.
PeerJ Computer Science, 10, e1918
Under Review / Forthcoming
CAUS: A Context-Aware Usability Scale for Evaluating Multimodal Interaction in Driving
AutoUI 2026 Under Review
Quantifying Modality-Switch Cognitive Cost: An Event-Level Usability Metric for Multimodal Interaction
ICMI 2026 Under Review
From Detection to Alignment: Evaluating Affect-Adaptive Response Strategies in Voice-Only Conversational Agents
CUI 2026, Bremen Under Review
Development and Validation of a User Experience Evaluation Questionnaire for Voice User Interfaces
Under Review
Comparative Usability Evaluation of Voice and App-Based Interaction Modalities in Smart Home AI Assistants
Under Review
Experience
Scientific Employee / Assistant Professor
Nov 2024 – Present
Teaching Human-Centered AI, Intelligent User Interfaces, and Multimodal Systems to the next generation of computing researchers. Supervising M.Sc. theses on topics I genuinely find fascinating — from real-time driver emotion detection to Human-in-the-Loop AI design systems. Alongside teaching, I pursue active research in ubiquitous computing, XR interaction, and emergency response interfaces (Feuerwehr-App, Saxony).
Senior HMI-UX Researcher
Nov 2022 – Oct 2024
Fraunhofer IAIS, Sankt Augustin, Germany
Led the human-centered evaluation programme for OpenGPT-X, the Foundation Model Playground, and SPEAKER — three of Germany's flagship AI research initiatives. The central challenge: defining what "good" looks like for AI systems when the end users aren't AI researchers. Built evaluation frameworks measuring usability, interaction coherence, cognitive load, and real-world robustness across multimodal deployment contexts.
UX Research Engineer
Feb 2019 – Oct 2022
Deutsche Telekom (via axxessio GmbH)
Built Deutsche Telekom's Voice Lab from the ground up — establishing evaluation standards aligned with ETSI specifications and Alexa certification requirements. Ran systematic usability studies for the Magenta Smart Speaker and TelekomBot, designed end-to-end conversational AI assessment protocols covering ASR accuracy, NLU understanding, and holistic interaction quality, and applied mixed-method evaluation to improve robustness in healthcare and smart-home deployments.
Research Assistant / Directed Researcher
Dec 2015 – Feb 2018
Fraunhofer IGD, Darmstadt, Germany
Conducted HCI research across the full spectrum of usability evaluation methods — expert review, participatory study, quantitative survey — and synthesised findings into an automated evaluation analysis tool that tracked usability progress across the software development lifecycle. Applied across four projects: Trend Analysis, FUPOL, Process Adaptation, and Smarter Privacy.
Visiting Researcher
Jun 2011 – Jul 2014
Indian Institute of Science (IISc), Bangalore, India
Early-career research on inclusive interface design for India's extraordinarily diverse population — users spanning multiple languages, literacy levels, and degrees of digital experience. This work shaped a lasting commitment to designing technology that works for people beyond the "average user" assumption.
Teaching & Supervision

I teach research-driven, project-oriented courses at TU Bergakademie Freiberg, with a focus on getting students building and evaluating real systems — not just reading about them.

  • Interactive Ubiquitous Systems & Intelligent User Interfaces (full course — B.Sc./M.Sc.)
  • User-Centered Design (full course — B.Sc./M.Sc.)
  • Data Structures in C (lectures, exercises & practicals)

Thesis Topics Supervised

  • Evaluating UX in Multimodal Voice Interfaces — studying how combining voice with touch reshapes the interaction experience (M.Sc.)
  • Personalization in Voice User Interfaces — measuring the real impact of tailored responses on user satisfaction and engagement (B.Sc.)
  • Designing VUIs for Non-Native German Speakers — addressing usability and comprehension gaps often overlooked in mainstream VUI design (M.Sc.)
  • Driver Emotion Detection via In-Cabin Sensors — building an AI system that adapts the vehicle interface in real time based on detected stress or fatigue (M.Sc., Ongoing)
  • Human-in-the-Loop Learning for AI-Powered Design Systems — exploring how structured human feedback improves generative design quality and usability (M.Sc., Ongoing)

Grants & Collaborations

  • DAAD-DST-PPP — International research collaboration between BITS Pilani (India) and TUBAF (Germany) on emotionally intelligent voice assistants
  • ESF-Nachwuchsforschergruppen — SharedBots: designing and evaluating multi-user autonomous robotics for public and semi-public spaces
Contact

Whether you are a prospective student, a researcher interested in collaboration, or just want to say hello — feel free to reach out.