Chapter 7: Critically Appraising Qualitative and Mixed Methods Evidence for Clinical Decision Making
Welcome to Last Minute Lecture.
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replace the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Imagine you're an intensive care unit nurse during the absolute height of the COVID -19 pandemic.
Oh, wow.
Yeah.
A really intense environment.
Right.
And a researcher comes onto your ward, looks at the patient charts, and notes that a specific intervention caused blood pressure to drop by an average of like 10 points across 50 patients.
That data point is the what, you know?
It's precise, it's neat, and well, the medical field completely loves it.
They absolutely do.
But that number tells the researcher absolutely nothing about the sheer burnout you experienced during a 12 -hour shift or the immense fear the patients felt or just the resilience required to keep that war functioning.
Yeah, the numbers are blind to all of that.
Exactly.
To capture that reality like the how and the why of health care, those clean numbers are practically useless.
So welcome to our deep dive.
Consider this a special one -on -one tutoring session designed specifically for you, the college nursing or health sciences student listening right now.
We're so glad you're here with us.
Yeah.
Our mission today is to completely decode the
qualitative and mixed methods evidence.
We want to turn those really dense research concepts into confident, sound, clinical decision -making.
And to navigate that massive gap between numeric data and human reality, we really have to ground ourselves in the core philosophy of evidence -based practice.
Which is so much more than just numbers.
Right.
Exactly.
It's important to remember that CBP is not just about blindly following a randomized control trial.
I mean, true clinical excellence requires the successful integration of four distinct elements.
Okay, lay them out for us.
First, the best research evidence.
Second, your own clinical expertise.
Third, the patient's unique circumstances.
And fourth, their valued preferences.
That makes a lot of sense.
It's a whole picture.
Yeah.
And when you're looking at human experiences, qualitative research provides that best evidence part.
But to figure out if a specific qualitative study is actually rigorous enough to base your patient care on, you need a roadmap.
You can't just take their word for it.
No, definitely not.
Researchers and nurses use a structured tool called a rapid critical appraisal checklist.
Ah, the RCA checklist.
Exactly.
This tool is like the ultimate guide to systematically evaluating the validity, the results, and the clinical relevance of qualitative evidence before you ever apply it to a patient.
Okay, let's unpack this.
Because to use a rapid critical appraisal checklist effectively, you first need to understand the fundamental nature of the evidence you're looking at.
Right.
We know qualitative research is rooted deeply in the social and human sciences, right?
It's designed to answer those meaning questions.
The big why question.
Yeah.
But it major one we encounter is ethnography.
Yeah, ethnography, which is essentially the study of a social group's culture.
So like anthropology stuff.
Basically, yeah.
A researcher doesn't just send out a survey.
They embed themselves through field work and participant observation.
They're literally in the room.
Oh, yeah.
They look at physical artifacts.
They conduct interviews in the natural setting.
And this is key.
They have to carefully balance two perspectives.
Okay, what are they?
First is the emic view.
Emic with an M.
Yep.
That's the insider's perspective, people in the culture understand their own world.
And second is the etic view with a T, which is the researcher's outsider analytical perspective.
Oh, I see.
So balancing the insider experience with outsider analysis.
Exactly.
For example, a researcher might embed themselves in a specific community to study how different cultural beliefs about birth settings directly impact maternal outcomes.
That is so fascinating.
So ethnography is perfect if you want to understand existing culture.
But what if there isn't an established culture to observe?
What do you mean?
Well, what if you're looking at a completely unique human challenge and you actually need to build a brand new working theory from scratch?
Because a manual just doesn't exist yet.
Oh, I see where you're going.
How does qualitative research handle that?
That is where the second major tradition grounded theory comes in.
Grounded theory.
Yeah.
It was developed originally by
Glazer and Strauss.
The entire goal here is to generate a working theory directly from empirical data.
That sounds like how does that even work?
Think of it like a mechanic trying to fix a completely new type of engine without a manual.
Oh, I like that analogy.
Right.
They can't look up the theory in a book.
They have to build the manual by watching the engine run.
Researchers do this through a mechanism called constant comparison.
Wait, how does that actually work in practice, like step by step?
So they conduct an interview, pull out a concept and code it.
Then they conduct the next interview and immediately compare the new data back to the first interview.
Oh, so they don't wait until the very end to analyze everything.
No, not at all.
They are constantly shuffling and comparing data as it comes in.
They use what's called theoretical sampling, meaning they seek out specific participants who can test the emerging theory.
And they just keep doing that forever.
Well, no, they keep going until they reach saturation.
Saturation.
Okay.
Yeah, saturation just means no new information or themes are emerging from the interviews.
Like you're hearing the same things over and over.
Oh, that makes sense.
Ultimately, they're looking to identify a basic social process.
Give me a clinical example of that.
Sure.
A great clinical example is studying the psychological and logistical process by which mothers adapt to caring for their adult children who have just been diagnosed with schizophrenia.
Wow.
Yeah, there is no standard manual for that specific chronic grief and adaptation.
Exactly.
So the theory must be grounded in the raw data from those mothers.
That makes the distinction really clear.
Ethnography explores culture.
Grounded theory builds a framework from a social process.
You've got it.
But what if the clinical question isn't about a process at all?
What if it's just about the raw, unfiltered feeling of an experience?
Then you turn to the third tradition, which is phenomenology.
Yeah.
And that often includes hermeneutics.
Okay.
Phenomenology.
Big word.
Yeah.
But it's entirely focused on studying the lived experience or what philosophers refer to as the essences of a phenomenon.
Essences.
Like the core feeling.
Exactly.
It seeks to understand what an experience feels like for a person before they have even conceptualized or intellectualized it.
I want to push back on a specific concept used in phenomenology, though.
Okay.
Let's hear it.
The concept of bracketing.
Because the premise is that researchers have to suspend their own beliefs and previously acquired knowledge about a phenomenon before they study it.
Right.
Yes.
That's the idea.
It reminds me of a jury being instructed by a judge to completely wipe their mental whiteboard clean so they can view a trial without any prejudice.
That's a really good comparison.
But can human researchers truly achieve that?
Like, can you actually wipe your own brain clean of bias?
Well, that is a highly debated question in the field.
And it actually points to a major historical division.
Oh, really?
Between who?
Well, if you follow the philosophical rubes of Edmund Husserl,
his approach to phenomenology is purely descriptive.
He believed that, yes, through a disciplined process of introspection called phenomenological reduction,
you can and must peel back the layers of your preconceived notions.
So you just bracket out your biases to describe the pure essence of the experience?
That's Husserl's view, yeah.
But not everyone agrees with that.
Not at all.
Hermeneutics,
which stems from philosophers like Heidegger, takes a completely different stance.
What's Heidegger's take?
Hermeneutic phenomenology is interpretive and dialogical.
It argues that as a human being, you cannot completely strip away your own historical and social context.
It's just impossible.
Yeah, I tend to agree with that.
You are who you are.
Right.
So instead, you use your background to engage in a thoughtful dialogue with the data.
You explicitly acknowledge that your own interpretation is an unavoidable part of how the meaning is ultimately understood.
That is a fascinating philosophical split.
It really is.
Now, even understanding these three main traditions, if I look at two different ethnographies or two different grounded theory studies, they can still look completely different on the page.
Oh, absolutely.
Why is there so much variation within the exact same tradition?
Because of the internal diversity of qualitative research, the actual how -to of the methodology.
Okay, break that down for me.
This diversity stems from choices in representation, description, and data collection techniques.
Let's look at representation first, which is kind of the conceptual lens through which the research is written.
Give me an example of a lens.
Well, you might read a critical ethnography.
The lens there is focused heavily on emancipation from oppressive circumstances.
Okay, so a very specific goal.
Right.
Or you might read a study grounded in feminist epistemologies, where the primary lens is focused on investigating gender inequality and mobilizing research for social change.
So the lens the researcher chooses completely changes how the data is represented to the reader.
Exactly.
And that ties directly into how the data is described.
The concept of thick description by anthropologist Clifford Geertz comes to mind here.
Yes.
Thick description is the absolute lifeblood of qualitative work.
It's such a great phrase.
Thick description.
It is.
It means a researcher isn't just recording a thin factual account like just saying, a patient cried.
Because that doesn't tell you much.
Right.
They're capturing the textures, the environmental context, the complex social dynamics in the room, and the emotional weight of the moment.
So when a nurse reads a thick description of a patient's experience, they shouldn't just understand it intellectually.
No, they should feel the visceral reality of that patient's world.
But getting that level of detail requires specific tools, right?
A researcher can't just stand in a corner and hope a thick description writes itself.
Definitely not.
They use observation techniques like writing exhaustive field notes and analytic memos to themselves to track their own thoughts.
What about talking to people?
Oh, interviews are huge.
They can range from totally unstructured, open -ended conversations where the patient leads the way to highly structured formats.
Or even focus groups, right?
Yeah.
Focus groups are great to see how patients interact with each other.
And once they have all those field notes and interview transcripts, the analysis phase begins.
How do they actually extract the meaning from hundreds of pages of text?
They have a few options.
They might use narrative analysis, looking at the entire arc of a patient's story.
Okay.
They might use discourse analysis, dissecting the specific power dynamics in conversational language.
Or they might use content analysis,
where they systematically break down all that text into manageable codes and thematic categories.
Wow, that sounds like a lot of work.
It is.
But the success of all these techniques depends entirely on who you are actually studying.
Which brings us to sampling strategies.
Yes, sampling.
Because qualitative studies often use purposeful sampling, meaning they intentionally select people who have experienced the phenomenon, right?
Exactly.
They use snowball sampling, where one informant recruits others they know, and they use convenient sampling, just grabbing whoever is available.
Yeah, those are common.
Wait a minute.
Let me stop you right there.
In quantitative research, a randomized statistically representative sample is the absolute holy grail.
Isn't convenience sampling or just purposefully handpicking a tiny non -random group of people just fundamentally bad science?
I get why you'd think that.
Like, how can a nursing student trust a study of just 12 people?
What's fascinating here is how the definition of quality shifts entirely between the quantitative and qualitative paradigms.
Okay, how so?
In qualitative research, you do not want statistical representation.
You want rich, deep, complex data.
And this is governed by a brilliant concept called information power.
Information power?
Yes.
The principle of information power dictates that if your purposeful sample holds highly relevant, deeply textured information about the specific experience you are studying, you actually need fewer people to reach data saturation.
Oh, because you aren't trying to prove how many people in the general population experience a phenomenon.
Precisely.
You're trying to deeply understand the architecture of the phenomenon itself.
That makes total sense.
Think about it.
If you want to understand the extreme specificities of navigating the healthcare system as a double amputee veteran,
randomly sampling a thousand people who know absolutely nothing about that specific human experience would completely ruin the study.
You'd get terrible data.
Right.
You need the 10 people who actually live it every single day.
Yeah.
Okay.
The concept of information power completely reframes the small sample sizes for me.
But even with deep data, how does a student actually prove the study is rigorous enough to justify changing their nursing practice?
Because you still have to evaluate it.
Exactly.
We have to separate the rigorously conducted studies from the poorly executed ones.
We do.
And that brings us back to the rapid critical appraisal tools.
Yeah.
Specifically utilizing general criteria for qualitative appraisal.
And what's the gold standard there?
The undisputed gold standard is Lincoln and Guba's trustworthiness criteria.
Okay.
Lincoln and Guba.
Yeah.
These criteria are essentially the qualitative mirrors to the quantitative standards of validity and reliability.
Let's walk through those parallels because understanding the mechanism behind them is so crucial for our listener.
All right.
The first criterion is credibility, which parallels internal validity in a quantitative study.
So you're basically asking,
is this data accurate?
Exactly.
To prove credibility, researchers use mechanisms like negative case analysis, where they actively search their data for outliers that contradict their emerging theory.
Just to make sure they aren't ignoring inconvenient facts.
Right.
They also use triangulation, collecting data from multiple different sources to cross -verify.
And most importantly, they use member checks.
Oh, I love the mechanism of member checks.
It's so cool, isn't it?
Yeah.
The researcher literally takes their final written analysis back to the actual patients they interviewed and asks, hey, did I capture your experience accurately?
Yes.
And if the patient reads it and says, no, that's not what I meant at all, the researcher has to revise it.
It's a massive safeguard against the researcher projecting their own bias onto the data.
It is absolutely essential for credibility.
Okay.
What's next?
The second criterion is transferability,
which parallels external validity.
Meaning, can these findings be applied elsewhere?
Exactly.
And this relies heavily on that thick description we discussed earlier.
Oh, because the context matters.
Right.
The researcher must provide so much rich contextual information about the setting and the participants that you, the clinician reading the study in a
That is so practical.
And the third criterion.
Dependability, which parallels reliability.
But a reliable quantitative study can be perfectly replicated.
You can't replicate a human experience.
Right.
You can't.
So in qualitative work, dependability requires an audit trail.
An audit trail, like a paper trail.
Exactly.
The researcher must leave a meticulously documented paper trail of every single methodological decision they made.
That way, an outside auditor could review the process and agree that the conclusions are logical.
That's a great safeguard.
And finally, there's confirmability, which parallels objectivity.
Making sure the findings are genuinely grounded in the raw data.
Yes.
The researcher must clearly link their final interpretations directly to the raw quotes from the participants.
The appraisal criteria also touch on authenticity, right?
Specifically using a fantastic term,
verisimilitude.
Verisimilitude, yes.
It essentially means the writing is so potent and true to life that it draws the reader vicariously into the experience.
It feels real.
Yeah.
So what does this all mean for you, the student trying to appraise evidence?
It means you absolutely should not look for a p -value to prove a qualitative study is good.
No, you definitely won't find one.
Right.
Instead, you must look at the methodology and ask, did they do member checks?
Is the audit trail clear?
Is the thick description rich enough to help me care for my patients?
You hit on the exact reason p -values are useless here.
If you're searching for statistical significance in a patient's interview transcript, you're missing the point entirely.
Entirely.
And before we bridge into the next topic, it's really worth understanding metasynthesis.
Okay.
What is that?
You know that a quantitative meta -analysis mathematically crunches numbers from dozens of studies to find a definitive statistical trend, right?
We see those all the time.
Well, a metasynthesis is the qualitative equivalent.
It's the holistic translation of multiple qualitative studies to create brand new knowledge.
How does that look in practice, though?
Imagine taking a dozen different qualitative studies from around the world about parents trying to access specialized health care for children with autism.
By synthesizing all those thick descriptions,
researchers build a comprehensive, overarching body of evidence about the shared systemic barriers those parents face.
A single small study could never achieve on its own.
Exactly.
Which is an incredible segue.
Because qualitative research provides the rich how and why, while quantitative research provides the precise what.
Right.
But what happens when a clinical question is so complex and so multi -layered that it demands both?
Enter the world of mixed methods evidence.
Yes.
This is where clinical research becomes truly dynamic.
It really is.
But we must strictly differentiate here.
Mixed methods is not the same thing as multiple methods.
What's the difference?
Multiple methods is just running a quantitative survey and a qualitative interview in the same study to answer two different questions.
So they don't interact.
Right.
The data sets don't interact at all.
True mixed methods involves complementary engagement.
The methods work together to address the exact same research question, resulting in synergistic meta -inferences.
Okay, synergistic meta -inferences is quite the academic mouthful.
It is a bit jargon heavy, I admit.
You basically mean that the sum is greater than the parts.
Right.
Yes.
Like the final clinical conclusion is a totally new insight that you could never have reached using just the numbers or just the interviews alone.
Exactly.
They interact to create a higher level understanding.
The fork and knife analogy is the perfect way to visualize this.
Oh, I love that analogy.
Right.
You can use a fork alone to eat a salad.
That's your quantitative study.
You can use a knife alone to butter a piece of bread.
That's your qualitative study.
But if you're served a complex meal, like a thick steak, you absolutely need both tools.
You do.
And you don't just use them randomly.
There's a highly choreographed ritual to using them together.
You have to know exactly when to cut with the knife, how to hold the fork, and when to stab.
Mixed methods is that choreographed ritual.
That is so spot on.
If we connect this to the bigger picture of clinical practice, think about a routine patient assessment.
Okay, let's hear it.
Assessing a patient's respiratory rate at 10 breaths per minute is a hard quantitative metric.
Yep, pure numbers.
But on its own, a number doesn't tell you the patient's level of distress.
It requires the qualitative assessment of whether the patient is actually experiencing dyspnea, the subjective, terrifying feeling of not being able to catch their breath.
Oh, wow.
Merging the quantitative rate with the qualitative experience gives you the comprehensive clinical picture.
If mixed methods is this choreographed ritual with the fork and the knife, how does a researcher actually decide the sequence of steps?
The clinical research question itself dictates the choreography.
Researchers use three primary mixed methods designs and they use a very specific visual notation to explain the priority of the methods.
Strands, right?
Yes, they use strands.
An uppercase strand like all caps, Q -U -A -N or Q -U -A -L means it is the priority method driving the study.
A lowercase strand means it's the secondary supporting method.
Let's walk through the mechanics of the three primary designs.
Okay, the first is the exploratory sequential design.
The notation here is uppercase Q -U -A -L and arrow, then lowercase Q -U -A -N.
So Q -U -L to quan.
Right.
You use the sequence when investigating completely unfamiliar subjects where no existing theory exists.
You have to explore the landscape first.
I have to admit that seems completely backwards to me.
Really?
Why?
Why would you do the qualitative interviews first and the hard statistics second?
Wouldn't you want to run the numbers first just to know what broad trends you're even looking at?
It seems counterintuitive, yeah.
Until you realize that in an unknown phenomenon, you don't even know what variables to matter yet.
Let's go back to the first wave of COVID -19.
If you wanted to study safety incidents among hospital staff, you couldn't send out a quantitative survey because you didn't know what the specific safety factors were.
Because nobody had ever experienced it before.
Exactly.
First, you have to interview the staff to figure out what is actually happening on the floor.
That's your qualitative priority strand.
And then?
Then, once you've identified the factors through those interviews, you can design a target statistical survey to measure the relationships between those newly discovered factors.
That's your secondary quantitative strand.
The exploration must precede the measurement.
Okay, that makes perfect sense.
You literally can't measure what you haven't identified.
What is the second design?
The expandatory sequential design.
The notation reverses.
Uppercase Q -U -A -N, an arrow, then lowercase Q -U -A -L.
Q -U -A -N to qual.
Yes.
You use this when you already have a body of quantitative evidence, but you seek deeper meaning behind the numbers.
Give me an example of that.
Sure.
Like investigating nursing student burnout.
First, you deploy a validated measurement tool to a thousand students to find the exact prevalence and statistical severity of burnout syndrome.
That is your priority Q -U -N -N strand.
Okay, so you have the broad data.
Right.
Then you purposely select the specific students who scored the absolute highest for burnout and you interview them to understand their personal lived perception of it.
That's your secondary qual strand.
Here's where it gets really interesting.
The cause and effect logic of the explanatory design is flawless.
You literally cannot effectively interview students about their severe burnout, the qualitative piece, until you first use quantitative tools to accurately identify who actually has severe burnout.
Exactly.
The quantitative data is the necessary prerequisite for the qualitative deep dive.
You've nailed it.
And the third design.
The third is the convergent parallel design.
This is simultaneous.
So both at the same time.
Yep.
You run the quantitative and qualitative strands at the exact same time, independently gathering concurrent multiple perspectives of the same phenomenon.
And then what?
Once data collection is finished, you merge the two results to see how they compare and contrast.
So just like we use the trustworthiness criteria to appraise qualitative studies, we need a specific critical appraisal tool to ensure this complex mixed methods fork and knife routine was executed properly.
We do.
When you're appraising a mixed methods study, you're focusing your evaluation on three key pillars.
What's the first pillar?
The first pillar is, are the strands independently rigorous?
Remember, a mixed methods study is an amalgamation.
If the qualitative phase lacks credibility, or the quantitative phase lacks statistical validity on its own, mixing them together won't magically save the study.
Right.
Mixing contaminated water with clean water just gives you a larger bucket of contaminated water.
Exactly.
What a great visual.
What is the second appraisal pillar?
Effective integration.
Did the researchers genuinely mix the methods or just staple two separate studies together?
How do you actually mix them?
This integration happens in a few specific ways.
Phase connection is when the results of phase one literally dictate the data collection of phase two, like we saw in the sequential design.
Results comparison involves directly contrasting the final result of both independent phases against each other.
And data assimilation is the complex process of transforming qualitative themes into numerical codes, merging everything into one massive data set.
And that brings us to the third appraisal pillar, which might be the most intellectually challenging, addressing inconsistencies.
Yes, the inconsistencies.
What happens if the qualitative interviews completely clash with the quantitative survey numbers?
The Rubik's Cube analogy in the text is brilliant for this.
It really helps visualize it.
Imagine two people looking at opposite sides of a Rubik's Cube.
One person looks at their side and says, this entire cube is red.
The other person looks their side and says, no, the data clearly shows this cube is yellow.
And because of their specific methodological vantage point, they're both technically right.
Exactly.
A poorly executed mixed method study just ignores the yellow data because it complicates the conclusion.
Which is bad science.
Very bad science.
But a rigorously conducted mixed method study embraces the clash.
It explains the multidimensional nature of the Rubik's Cube.
The researchers must explicitly address the conflict.
Yes, they might reconcile it by finding a deeper theory.
They might initiate brand new research questions to explain the discrepancy.
They might bracket the context.
Or they might justifiably exclude flawed data.
But they absolutely must explain how they handle the contradiction.
This raises an important question regarding how we grade the overall strength of a mixed method study.
Oh, how do we grade it?
Because of the immense complexity involved in mixing paradigms, the rule of thumb for appraisal is incredibly strict.
A mixed method study cannot be rated as high quality overall if either of its individual strands is deemed medium or low quality.
Wow, really?
Yes.
A chain is truly only as strong as its weakest link.
If you have a brilliantly executed quantitative strand but a sloppy qualitative strand, your overall mixed methods evidence grade drops to go.
They hold these mixed studies to an incredibly high standard, which is exactly what we want when making life and death clinical decisions.
We wouldn't want it any other way.
Okay, let's look back at the logical journey we've taken today.
We started by mapping out the unique methodologies of qualitative research understanding culture through ethnography, building processes with grounded theory, and exploring lived experiences through phenomenology.
We learned how the evidence -based practice process appraises this textual evidence, swapping out mathematical p -values for trustworthiness criteria like credibility and dependability.
Yes.
Then we brought it all together with mixed methods, showing how the strategic choreographed integration of both qualitative and quantitative data leads to comprehensive clinical decision -making.
And ultimately, that sound decision -making supports practice change and vastly better patient outcomes.
Which is the whole goal.
Exactly.
As we wrap up this deep dive, I want to leave you, our listener, with a final thought to mull over as you head into your clinical rotations.
Let's hear it.
Qualitative and mixed methods research requires us, as scientists and caregivers, to bravely embrace the gray areas of human existence.
In a medical world that is heavily obsessed with objective certainty, rigid algorithms, and clean data points, how might your own nursing practice change tomorrow if you actively searched for the thick descriptions hiding behind your patient's numeric vital signs?
That is a deeply powerful question to carry with you onto the ward.
To our student listening, congratulations on mastering the complexities of qualitative and mixed methods appraisal.
You've done amazing work today.
You really have.
You now have the tools to evaluate the how and the why of patient care, not just the what.
Thanks for diving deep with us today on this special edition from the Last Minute Lecture team.
You've got this.
ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML ♥Related Chapters
- Critically Appraising Quantitative Evidence for Clinical Decision MakingEvidence-Based Practice in Nursing & Healthcare: A Guide to Best Practice
- Evaluating Clinical EvidenceBates' Guide To Physical Examination and History Taking
- Evidence-Based Clinical Practice GuidelinesAdvanced Health Assessment & Clinical Diagnosis in Primary Care
- Evidence-Based Practice Mentors: The Key to Sustaining Evidence-Based Practice in Clinical and Educational SettingsEvidence-Based Practice in Nursing & Healthcare: A Guide to Best Practice
- Finding Relevant Evidence to Answer Clinical QuestionsEvidence-Based Practice in Nursing & Healthcare: A Guide to Best Practice
- Implementation Science to Clinical Practice Settings: Accelerating the Uptake of Evidence Into Practice for Best OutcomesEvidence-Based Practice in Nursing & Healthcare: A Guide to Best Practice