Chapter 3: Conceptualizing Interaction

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Welcome to the Deep Dive.

Today we are undertaking a truly essential mission,

conceptualizing interaction.

We're going straight to the very foundations of design to understand how you define a digital product for where a single line of code is written or a single pixel is placed.

That's right.

And for anyone trying to get their head around the mechanics of interaction design, this deep dive is, well, it's your shortcut.

We're going to unpack the key strategies you need to move from, you know,

a vague idea to a concrete testable blueprint.

And what does that include?

Well, it includes defining the conceptual model, using powerful interface metaphors, outlining the core ways users can engage, and of course, recognizing the massive theoretical influences, the paradigms and visions that shape the entire field.

And to kick things off, let's start with a classic scenario that illustrates exactly why this conceptual work is so necessary.

Okay.

The voice assisted robot waiter in a restaurant.

Yes.

The initial enthusiasm for this idea is always based on what we call cutative benefits.

The designer gets really excited about the potential.

Like, it can take orders, it can entertain customers with witty banter, it can memorize everyone's preferences.

We fall in love with the tech, right?

Before we really understand the task.

Exactly.

The robot is a solution looking for a problem.

And the core lesson here is shifting that focus.

You have to ask, what is the actual problem we are trying to solve?

Oh, it's not about the cool robot.

It's not.

Maybe the restaurant owner's actual problem isn't entertaining customers.

Maybe the real constraint is that it's just incredibly difficult to recruit and retain good waitstaff.

And that one reframing changes everything.

The moment you define the actual problem, say staffing difficulty,

your design effort becomes focused.

It immediately forces these crucial research questions.

Right.

Like how intelligent does this robot really need to be to solve a staffing gap?

Does it need to move smoothly or is rolling around okay?

And maybe most importantly, will customers actually like it?

Or will they find it gimmicky and distracting after the first week?

Precisely.

It turns that kind of blue sky thinking into actionable research.

It makes sure you're spending your resources solving the right challenges, not just implementing the flashiest new thing.

Okay, so let's unpack this.

The first analytical step in conceptualizing any interaction is the discipline of writing down your assumptions versus your claims.

This step is so fundamental.

An assumption is something you're taking for granted, which means you absolutely must investigate it.

So for example, for example, assuming that because everyone has a smartphone, they automatically want a super complex entertainment system built into their car.

That's a huge assumption.

And a claim, on the other hand, is a specific assertion of truth that is, you know, open to debate or requires proof.

Right.

You might claim that a multimodal interaction style where a user speaks commands or gestures while driving is perfectly safe.

That's a bold claim.

A very bold claim.

It needs rigorous testing.

We can see how this works in this hypothetical scenario from the source text about a web browser redesign team.

They're trying to

Right.

One designer might claim the bookmark function is just error prone.

It's too hard to organize.

But then a software engineer weighs in with a conflicting claim.

Bookmarking is obsolete.

People just use their history list now.

It's quicker.

So they're at a stalemate.

Exactly.

But the beauty of articulating these conflicting assumptions about user behavior is that it forces the team to take a step back.

They realize their disagreement is all about user intent.

So the process isn't about deciding who's right.

No, it's about forming a concrete testable investigation.

Like, can we simplify the entire process of saving, ordering, and retrieving websites on mobile browser?

That question defines the next stage of design.

And the danger of skipping this process.

I mean, the danger of running on faulty assumptions is measured in billions of dollars of failed products.

Let's look at the classic examples.

3D TV and curves TV.

Oh, the failure of 3D TV in the home market is a textbook case of faulty assumptions.

The assumption was that the amazing floorback for movies like Avatar in the cinema would translate directly to the living room.

And that people wouldn't mind wearing special glasses or paying more.

The claim was this incredible enhanced viewing immersion.

But the usability problems were just overwhelming.

It wasn't just the annoyance of the glasses.

It was the the social friction.

What do you mean by that?

The moment you put on glasses, you stop being able to see or interact naturally with the person next to you on the couch.

Right.

You can't multitask.

You can't.

People complained of motion sickness and you certainly couldn't glance down at your phone easily.

That social and physical friction, it just destroyed the concept.

And curved TV.

Curved TV failed because the assumed benefits were trivial compared to the cost.

The conceptual model that a curved screen somehow provides a better field of view for the living room, it just didn't deliver enough value.

Faulty assumptions in birth cases proved fatal.

This is exactly why forming the conceptual model early is so important.

It's the solution to those faulty assumptions.

And it delivers three massive benefits for any design team.

Orientation.

It lets the team ask precise specific questions instead of vague ones.

And open mindedness.

Yes.

It encourages exploration of diverse ideas before you commit to one expensive approach.

And crucially, common ground.

It establishes a shared language for the team, creating a testable blueprint everyone understands.

And that blueprint, the conceptual model, is just a high level description of what people can do with product and the concepts they need to understand it.

It's the simple story we tell the user about how this thing works.

And what are its core components?

They're few, but powerful.

One, metaphors and analogies linking to familiar concepts.

Two, concepts.

So the objects, their attributes, and the operations you can do like saving or printing.

Okay.

Three,

relationships like containment of folder holds a file.

And four, mappings to the user experience.

The goal here is just relentless simplicity and memorability.

The best conceptual models literally transform activities.

When you think about the classics, the digital spreadsheet, the worldwide web, and of course, the desktop metaphor.

The desktop pioneered by the Xerox star in 1981 was so revolutionary because it was based on concepts everyone already understood from the office.

Paper, folders, mailboxes.

Or filing cabinets.

It was the great democratization of computing.

Before that, you needed to master the command line.

You had to speak in riddles to the machine.

The desktop model suddenly made computing visually accessible and intuitive.

So metaphors are really central to the whole system.

Absolutely.

They provide a structure similar to something you already know, but with its own digital properties.

A search engine is a perfect example.

It invites a comparison to a massive mechanical engine, helping users connect their need to find things with this complex algorithmic function.

And metaphors also manage our expectations.

They do.

Take the recycle bin or trash can on the desktop.

Logically, if you throw something away, it should go under the desk, right?

Not sit prominently on the desk.

That's a good point.

But it succeeded because its visibility provided a safety net.

It let users easily retrieve deleted items.

We accepted that logical inconsistency for the benefit of visibility.

What's fascinating now is watching how these metaphors are evolving, especially with mobile devices.

The shift to the contemporary card metaphor is huge.

The card used heavily in social media feeds and by services like Google Now is brilliant because it has these familiar paper -based associations.

It structures content into

chunked sections, which is perfect for rapid scanning on a small screen.

And looking to the future, there's a suggestion to frame design goals using two conceptual metaphors, the glasses versus the tool.

Right.

And this distinction is incredibly insightful for thinking about how we interact with AI.

So what's the difference?

The glasses metaphor means the technology amplifies human cognition implicitly.

You put on the glasses, and the enhancement is almost invisible or subconscious.

The system tries to disappear.

But the tool metaphor, like binoculars or a voice recorder, requires explicit conscious interaction.

Exactly.

You pick it up, you aim it, you operate the controls, and then you put it down.

So if you're building a new AI assistant, is it trying to be invisible glasses subtly helping your daily tasks, or is it a tool demanding you consciously operate it every time?

That one decision changes the entire interface design.

Which brings us to the actual mechanics of engagement, the five core interaction types.

These define how a person engages with any system.

They are instructing, conversing, manipulating,

exploring, and responding.

The first and oldest is instructing.

This is where you tell the system what to do, typing commands, selecting menus, pressing buttons.

It's quick, efficient, and great for repetitive tasks like saving a file or deleting an email.

And you can see the trade -off in instructing when you look at vending machines.

How so?

Well, the simple soda machine is pure instruction.

Press one button, get one product, super efficient.

But the snack machine with dozens of options forces a more complex interaction.

Right, you have to read a code like C7, key it into a keypad, check the price.

More options always mean more complexity and a higher chance of error.

The second type is conversing, a two -way dialogue where the system tries to act like a partner.

It's based on our most familiar human interaction, like asking Siri, do I need an umbrella today?

And having it interpret your implicit intent, the weather forecast, and respond naturally.

But it can feel natural, but it can also be incredibly cumbersome.

Oh, for sure.

Think about those auditory phone menus where you have to listen to a slow, endless list of options before you can finally do what you want.

The naturalness is completely undermined by the system's rigid structure.

Exactly.

Our third type is manipulating,

interacting with virtual objects by leveraging our physical knowledge moving things, zooming in, dragging a window.

This pulls heavily from the principles of direct manipulation.

Which were formalized by Ben Schneiderman back in 1983.

Right, and direct manipulation promotes rapid learning and a feeling of mastery because of its three core ideas, continuous visual representation of objects, rapid reversible actions with immediate feedback, and using physical actions like a drag and drop instead of complex commands.

But manipulation isn't always the best approach.

Definitely not.

While it's perfect for dragging a file to a folder, imagine you're writing a 100 page report and you realize you've misspelled a name like Schneiderman over and over.

Trying to find and fix that manually by manipulating the text would be a nightmare.

Exhausting and error -prone.

A simple command -based instruction, like find and replace all, is way more accurate and efficient.

It just shows that no single interaction type is always superior.

Fourth, we have exploring.

This is where you move through virtual environments using your innate spatial knowledge.

This includes navigating 3D worlds, virtual campuses, or complex data visualizations like KV systems where scientists can literally step inside a data set.

And finally we get to responding.

This is the newest category where the system actually initiates the interaction.

It's proactive and context -aware and the user responds to its prompt.

Like when Netflix pauses to ask if you're still watching.

Or when Google Lens identifies a dog breed in a photo and immediately pops up relevant information.

The system is sensing your context and responding to it.

The challenge here though is a fine line between helpful and annoying.

A very fine line.

If the system constantly provides inaccurate or irrelevant responses, you just get frustrated.

Which highlights the need for transparency in its decision -making.

So if we pull back, what are the bigger forces in forming all this conceptual design work?

Well, conceptual design is heavily influenced by a hierarchy of higher level concepts.

Paradigms, visions, theories, models, and frameworks.

Let's start with paradigms.

Paradigms are the overarching adopted approaches.

The shared assumptions that define an era of design.

We saw the big shift from the WIM paradigm of the 1980s.

Windows, icons, menus, pointers, which was focused on a single user at a desktop.

That WIMP paradigm was revolutionary.

It replaced technical mastery with visual understanding.

And then in the 90s, the shift to the ubiquitous computing paradigm.

Or UbiComp, inspired by Mark Weiser's vision of technology just sort of disappearing into the environment.

UbiComp suggested that technology would become ambient, embedded in everyday objects.

That evolved into the modern big data and IoT paradigm of the 2000s, driven by sensors, machine learning, and automation.

The focus shifted again from the individual user to systems optimizing actions on our behalf.

Like smart traffic lights or home heating systems.

Exactly.

Then you have visions.

These are future scenarios that frame R &D goals.

Apple's 1987 Knowledge Navigator video is a great example.

It showed a tablet interface and a sophisticated speech -based assistant.

And it basically set a research trajectory for decades.

And the current pervasive vision is, of course, AI and automation.

Which naturally raises huge concerns about transparency and accountability.

The urgent need to explain why AI systems make the decisions they do.

We also see how science fiction inspires this, like the Star Trek holodeck, though it's often limited by the author's own cultural perspective.

And to bring all these concepts together, we use frameworks that offer concrete advice.

Don Norman's classic framework is probably the most foundational.

It has three pieces.

The designer's model, how the system is supposed to work.

The system image, how the system is actually presented through the interface and manuals.

And the user's model, how the user actually understands it.

So the goal of good design is to use that system image to bridge the gap between the designer's model and the user's model.

You've got it.

That's the whole game.

This entire deep dive into conceptualization, it seems to me, ultimately culminates in a critical dilemma for the future of interaction.

Which is?

Who is in control?

Yes.

This tension is inherent in the five interaction types we discussed.

Instructing gives the user maximum control, a feeling of mastery.

But responding interfaces, the proactive, context -aware ones, they take control away from you.

Right.

In favor of convenience, automation, or even safety.

Like systems that monitor an elderly person's movements.

The moment the control shifts, you see these dependencies emerge.

Think about GPS navigation.

You're in control when you instruct the destination.

But once you're on the road, the system takes over.

We become technologically dependent.

We often blindly follow the technology over our own common sense.

Totally.

The system's convenience often trumps our own critical thinking.

That is a critical psychological cost that we have to design for.

So, to quickly recap our deep dive.

Conceptual models are crucial high -level descriptions.

Metaphors aid understanding by leveraging what you already know.

And understanding the five interaction types.

Instructing, conversing, manipulating, exploring, and responding defines exactly how users will engage with your product.

That's it in a nutshell.

And here is a final provocative thought for you to mull over.

Given the explosive shift toward autonomous AI systems, the biggest challenge ahead for interaction design isn't just designing the momentary experience.

It's designing for the long -term user experience over months and years and on a societal scale.

And that requires methods and tools that the field is, frankly, still racing to invent.

A profound challenge indeed.

That was a deep dive into conceptualizing interaction.

Thank you so much for joining us.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Establishing a solid conceptual model before building an interface ensures that design decisions are grounded in clear thinking and shared understanding across the team. This foundational work requires identifying and challenging underlying assumptions—factors taken for granted—and distinguishing them from claims that demand verification. When a design team reaches common ground, members operate from the same mental framework, reducing miscommunication and rework. A conceptual model functions as a high-level blueprint describing how a system works and what fundamental ideas users must understand to navigate it effectively. Within this model, interface metaphors serve as bridges between the familiar and the abstract, allowing users to apply real-world knowledge to digital tasks. The desktop metaphor, originating from the Xerox Star, established this approach, while contemporary designs like the card metaphor demonstrate its continuing relevance in mobile and social contexts. The chapter identifies five primary interaction patterns that shape design direction: instructing enables rapid execution of discrete commands suited to routine operations; conversing establishes a back-and-forth exchange where the system participates as an interactive partner, evident in voice assistants and dialogue systems; manipulating draws on physical intuition by letting users act directly on digital objects, forming the foundation of direct manipulation design; exploring allows users to navigate through simulated three-dimensional spaces or sensor-enhanced physical settings; and responding describes systems that take initiative by delivering contextually relevant information or notifications without explicit user prompting, as demonstrated by fitness trackers and image recognition tools. Interaction design draws from multiple complementary perspectives: paradigms represent prevailing design philosophies such as ubiquitous computing and the Internet of Things; visions outline aspirational future scenarios that guide research direction; theories provide empirically grounded explanations of human-computer interaction; models simplify particular aspects of user behavior; and frameworks like Norman's model of alignment between designer's model, system image, and user's model offer concrete guidance for analysis and creation. As artificial intelligence and context-aware systems grant increasing autonomy to technology, designers face critical questions about how much control and transparency users require to maintain meaningful interaction with their tools.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 3: Conceptualizing Interaction

Related Chapters