Conversational Banking

For my thesis for the bachelor's degree User Experience Design (B. Sc.), I conducted an independent research on conversational user interfaces in the field of mobile banking. The results of this 5-month study yielded valuable insights concerning the design of CUIs in an environment that deals with sensitive data but also new opportunities for future work in this context. This case study breaks down months of research and is rather in-depth to provide context. This thesis was published as a research article at the Mensch und Computer conference 2021. You can find the paper here: //dl.acm.org/doi/10.1145/3473856.3473872.

Strategies:

Research, UX Design, Prototyping

Duration:

March – September 2020

Tools:

Sketch, Swift, IBM Watson Assistant

Status quo

With conversational interfaces, spoken or written language is used to execute tasks instead of clicking several buttons on a graphical interface. However, there are often inconsistencies between users’ expectations regarding the capabilities of CUIs and their real skills, leading to frustration and disappointment. Taking into account the current trend of applying CUIs, users will presumably encounter them more frequently. Yet, it seems like there is room for improvement concerning the design of these interfaces to provide a successful conversational user experience. These findings lead to the following research question for my thesis:

How might we enhance both user acceptance and user experience of conversational user interfaces?

Research

I undertook semi-structured interviews with six users to identify and understand their pain-points and needs concerning their previous interaction with conversational interfaces. Based on the gathered findings, I chose a potential use case for CUIs to sharpen the focus of my thesis.

"In my opinion, chatbots are not the solution for everything and have to be used purposefully, but if they are used purposefully and consciously, then they are extremely awesome"

– Statement from a participant of the preliminary study

The interviews indicated that users currently use CUIs for rather simple tasks as they cannot imagine conversational interfaces for more complex use cases yet. Still, the discussions about various use cases of chatbots have shown that a general interest in CUIs and their new areas of application exists. This became clear when existing solutions are complex or when using a conversational approach alleviates and supports users.

Themes found via thematic analysis
Themes identified from the user interviews in relation to conversational user interfaces via thematic analysis.

I also gathered the participants’ opinions about current applications of chatbots and what desires and rejections they have about 15 potential use cases for conversational user interfaces. For each topic, they were asked to rate their willingness to use such conversational agents from 1 (very unlikely) to 7 (very likely) and explain their decision in more detail.

All mean values for specific CUI use cases
All mean values for the presented 15 potential use cases for conversational interfaces.

I focus on trust to keep things concise. Respondents reported low trust in CUIs' competence and doubted the feasibility and convenience of chatbots. Their confidence is limited due to perceived risks and concerns about security, especially with sensitive data. Surprisingly, they do trust task-oriented dialog systems, valuing their usefulness and ease of use.

"If it is a small amount, then I would certainly use such a chatbot. When it comes to financial matters, I would rather confirm this myself by using a button or something like that"

– Participant 04

Next, I selected the use scenario for my thesis. Despite financial service chatbots scoring an average of 3.33, trust issues were frequently mentioned. From a UX perspective, this lack of trust is particularly intriguing and worth exploring.

"Especially when it is a financial matter, safety is extremely important. If that can be guaranteed, then [I think it is] great. But I don’t know how this trust can be evoked"

– Participant 03

What is conversational banking?

Conversational banking, the latest in digital banking, enables customers to use natural language via social media, messaging apps, and voice assistants to access financial services. This offers financial institutions a cost-effective way to deliver personalized digital experiences.

Younger generations expect personalized, flexible banking solutions similar to the apps they regularly use, and older customers are also dissatisfied with current mobile banking options. Fintech companies add pressure by offering innovative, user-friendly IT solutions that surpass traditional banking services. Conversational interfaces in mobile banking can enhance user experience and help banks address both technological and socio-economic challenges.

Ideation

I implemented three tasks for all CUIs to ensure data comparability, based on common banking app functions with varying complexity:

UC 01 – Retrieving account information

Check account balance and list recent transactions. No additional TAN is required after login, as banks are allowed to request a TAN every 90 days

Complexity: Low

UC 02 – Sending money

Transfer €20 to a saved contact. No additional TAN is requested, as contacts and their IBAN can be saved.

Complexity: Medium

UC 03 – Paying an invoice

Pay an invoice with provided details (recipient, amount, IBAN, reference). The amount exceeds 30 Euros and the IBAN is not saved, thus a TAN is required. To generate the TAN, the second factor of the SCA is required, which requests the users to do a finger scan.

Complexity: High

Next, I outlined the conversation flow for the three mobile banking use cases and visualized it to assess the task flow and identify terms users would naturally use.

First draft of the conversation
Outlining the conversational use flow for the three identifed use cases in conversational banking.

I tested the proposed dialog and use flow with users. Their feedback focused on the assistant's wording, which caused ambiguities and the conversational flow. Previously, the assistant asked for the recipient and amount simultaneously for invoice transfers, but users often responded to only one. This feedback was incorporated into the final prototypes.

Virtual assistant NEO that supports users doing their financial services
The virtual assistant NEO that supports users doing their financial services.

After refining the task flow, I adjusted the conversation design:

Anthropomorphic features

Human-like cues help users anticipate CUI behavior but often lead to overestimated capabilities and frustration. To manage expectations, NEO adopts a robotic look while maintaining social mannerisms like thanking and apologizing for errors.

Tone of the conversation

In financial services, the tone is goal-oriented but designed to feel collaborative. NEO asks clear questions to guide users toward their goals while remaining flexible in its responses.

Prototype

To address user security concerns and ensure realistic prototypes, I incorporated PSD2 policy elements, including Strong Customer Authentication (SCA). This requires two-factor authentication (2FA) for online payments, using at least two of the following:

  • Knowledge: Something the user knows (e.g. password or PIN)
  • Ownership: Something the user owns (e.g. smartphone or smartcard)
  • Inherence: Something unique to the user (e.g. fingerprint or face recognition)

General features

All CUIs handle the three use cases consistently, with NEO always acting the same way. While users control the sending or transferring of money, the interface implementation varies. Thus user control, highlighted as essential in the preliminary study, is maintained. Additionally, all CUIs provide feedback on the success of actions with relevant details.

General features every CUI has implemented
Login screen of ‘NEObanking’ and prompt to use Touch-ID for generating the required TAN.

Speech-based CUI

After login, the CUI introduces its capabilities and the virtual assistant through a welcome screen. The speech version allows users to control input by starting and stopping recordings. To prevent errors, users can always start a new recording, which deletes the previous one. Sensitive data, like account balances or transactions, is displayed visually, not read aloud, addressing concerns about social embarrassment and privacy from the preliminary study.

Screenshots of the voice-based CUI only allowing voice input
Screenshots of the speech-based CUI. Left: the home screen displaying the virtual assistant and its features. Center: displaying the interaction with the active record button. Right: visual summary of all gathered details by the assistant and confirmation button to execute the transaction.

Hybrid CUI

After login, users are taken to the home menu with buttons linking to various functions, each with a brief description. If the wrong function is selected, users can return to the main menu with the back button. This ensures easy access to the assistant's capabilities and allows users to restart or end interactions. A fixed UI element displays relevant details in the chat, improving context and system transparency.

Screenshots of the hybrid CUI using speech control and visual UI elements
Screenshots of the hybrid CUI. Left: the start screen stating all available functions and their descriptions. Center: displaying the interaction with the fixed context window. Right: displaying all gathered details by the assistant and activated button for the user to confirm the transaction.

Text-based CUI

The two prototypes were compared to this text-based chatbot version, often used on messaging platforms. Before transferring an amount, regardless if a TAN is required or not, the agent shows payment details and asks for confirmation. Users then initiate the payment with a text-based confirmation.

Screenshots of the text-based CUI only allwoing text input
Screenshots of the baseline CUI. Left: the start message stating all available functions. Center: displaying the interaction. Right: displaying all gathered details by the assistant and text-based confirmation by the user to confirm the transaction.

The following video shows the implemented use cases with the three different prototypes in action:

User Study

18 participants tested the three prototypes in a randomized task order, completing the same tasks with different CUIs to reduce carry-over effects. The study included several context-adapted questionnaires: technology acceptance (Davis), trust (Jian, Bisantz, and Drury), UEQ-s (Schrepp, Hinderks, and Thomaschewski), and perceived security (Cheng, Lam, and Yeung). Participants ranked the prototypes and participated in short interviews.

Description of the user study process
Procedure of the user study

Based on the literature and prototypes, I formulated the following hypotheses:

  • H1: Acceptance differs significantly between the prototypes
  • H2: Trust differs significantly between the prototypes
  • H3: User experience differs significantly between the prototypes

Results

The statistical evaluation found no significant differences between the prototypes, so the hypotheses were not supported. However, design implications for conversational banking interfaces can still be drawn from the data.

Acceptance:
In the interviews, 11 of 18 participants expressed interest in using conversational interfaces for financial services, noting they were "easy to use", "supportive", "responsive to personal needs" and "clearer" than current solutions. Five were open to using it as an add-on to existing apps, while two rejected the idea.

Trust:
The speech-based CUI scored higher than the hybrid one, despite initial concerns. Trust is complex and influenced by factors beyond design, like security. Users valued security more than speed, suggesting clear communication of security prompts could build trust.

User experience:
The voice-based CUI scored highest for both pragmatic and hedonic quality. Despite concerns about voice input and errors, many users found it "efficient", "pleasant", and "easy to use", indicating that its benefits outweighed the drawbacks.

Perceived security:
Perceived security was more influenced by the use case and security prompts than the interface design. A touch-ID prompt, for example, increased users' sense of security, though the most favored interface wasn't always seen as the most secure.

Conclusion

In summary, users have a positive attitude toward conversational interfaces for financial services, with a preference for the hybrid approach. Key considerations for improving user acceptance include:

  • 01 Support modality
    Users want the option to interact via both voice and text, allowing flexibility depending on their environment.
  • 02 Enhance transparecy
    Clearer feedback is needed to clarify processes, such as confirming whether payments have been successfully processed.
  • 03 Provide shortcuts:
    Text-based interfaces can feel tedious, so adding predefined buttons can make interactions faster and more efficient.

Additional features like invoice scanning, reminders for due invoices and receipt notifications were also suggested.

Learnings

01 Balancing user metrics in user studies

Finding suitable standardized questionnaires to gather quantitative data that also matched the context of my thesis was quite intricate. A balance of quantitative and qualitative metrics is essential for user studies, as they can provide further valuable insights.

02 Ensuring clarity and natural flow in conversations

I was previously only used to design graphical user interfaces. With designing for conversational interfaces I realized that it is essential to write down the intended conversation and evaluate it with users. This helps tremendously to see if the texts are comprehensible and sound natural and also prevent conversational dead ends.

03 Efficient prototypes with Swift and IBM Watson

My knowledge of the programming language Swift was limited and I never used IBM Watson Assistant before my thesis. However, I was able to efficiently implement convincing prototypes with both tools which probably would not have been possible with conventional prototyping tools.


That's the end of this project.
Wanna go to the next?