Midterm Study Notes

 

An operational definition of a quantity is a specific process whereby it is measured. For example, the weight of an object can be operationally defined by using a balance and standard weights.

 

What is evaluation?

The systematic determination of the merit, worth, value, significance of something (standards and performance data) Program, Policy, Performance, Product, Process, and Personnel

 

n G&U: social work research is “a systematic and objective inquiry that utilizes the research method to solve human problems and creates new knowledge that is generally applicable, (p. 19).”

 

RESEARCHERS must:

 

BE ABLE TO SUMMARIZE SOME OF THE KEY DIFFERENCES BETWEEN APPLIED AND BASIC RESEARCH—POSSIBLE ESSAY QUESTION.

 

Basic: goal—to develop theory and expand knowledge base.

Applied: goal—to develop solutions for problems and applications in practice.

 

Applied and basic are more similar than different. They are similar in theory, methodology, and ethics. The difference between the two is in degree

 

 

Applied Research

Basic Research

PURPOSE

Knowledge Type

Knowledge Use: improve understanding of a problem to contribute to the solution of a problem--Is it making a difference?

Knowledge Production: to expand knowledge (id universal principles that contribute to understanding of how the world operates. Eg. NASA research.

Question Type

Broad in Scope: complex 'fuzzy' issues, multiple broad research questions, research in messy uncontrolled environment.

Narrow in Scope: specific topic, fundamental issues, tightly focused question. Eg. What is the effect of cocaine use on fine motor coordination? In controlled environment (laboratory), reduce measurement error or eliminating noise.

Significance

Practical & Statistical Significance: Q. if effects are of significant size to be meaningful. Level of outcome to audience and interest groups. Eg. New drug showed statistical significance but not practical significance--not enough change for the individual.

Statistical Significance: Q. if causal relationships exist and statistical significance.

Theory

Opportunism: use theory instrumentally by identifying variables and concepts that will likely produce practical results. Which theory will be useful? Does theory help solve the problem? Will combine theories.

Purity: Underlying theory is critical. Controlled environment. Researcher controls variables to represent the theoretical constructs. Eg. study deals with only anger and not frustration, boredom, fatigue, etc.

CONTEXT

Environment

Open: diverse environments, permission to obtain access to the data. Limited by resources and political/bureaucratic barriers and time

Often conducted in universities and academic environments. Laboratories

Initiator of Research

Client Initiated: research question often from client and they are often poorly framed on completely understood. Client in control; much negotiation scope, cost, time frame, etc. Trade-offs.

Researcher Initiated: The idea for the study, approach to executing it. More flexibility in question and design

No. of Researchers

Multidisciplinary research teams; often include community collaboration

Individual Researcher: autonomous, sets scope and approach, usually smaller teams (if more than 1 researcher), less collaboration

Stakeholders

Multiple stakeholders

Few stakeholders

Results

Results & Publishing negotiated

More free to use the results immediately

 

 

 

METHODS

Validity

External Validity Emphasized: the extent to which the study results are generalizable.

Internal Validity Emphasized: the extent to which a causal relationship can be soundly established. Both validities are important in both types of research.

Construct

Construct of Effect: Is it/are you credible? Valid outcome measures, accurately measure variable of interest. Multiple outcomes and multiple measures to assess construct.

Construct of Cause--Cause and Effect: the independent variable must be clearly explicated and not confounded with any other variables.

Levels of Analysis

Multiple Levels of Analysis: specific problem at more than one level of analysis (individual, group, organization, society); use multiple research methods and triangulation. Quasi-experimental design.

Single Level of Analysis: Multiple levels of analysis not as needed b/c of the control on the other variables. Experimental design.

Commitment to Research Design

Iterative design

Research conducted exactly as designed

 

KNOW THE DIFFERENCE BETWEEN THE TWO APPROACHES. POSSIBLE ESSAY QUESTION.

 

Approaches:

 

Quantitative: relies on quantification (or putting into numbers) in collecting and analyzing data and uses statistical and inferential statistics. (more in chapter 5)

 

Qualitative: descriptive methods of data collection. Data in the form of words, diagrams, drawings, observations, prime example is ethnography. (more in chapter 6)

 

Three Contextual Factors that Shape Social Work Research Studies

  1. Social Service Program (private sector, public social service setting) (a) accountability (b) all research has evaluative potential (c) accountability creates market for research (d) programs exist in hostile environments (e) programs have scarce financial resources (f) programs have client files
  2. Social work profession (a) professional values and ethics (b) profession’s beliefs and practices, and (c) the rewards for doing research
  3. Social workers themselves (a) social workers are people oriented (b) social workers have a vested interest in practice (c) social workers need research

 

KNOW AND BE ABLE TO DISCUSS:

Researchers enemies: bias, intervening (nuisance) variables, and chance (random error).

 

BE ABLE TO DISCUSS SOME OF THESE ETHICAL CONCERNS

 

The 7 main issues that social workers must be concerned about during the actual research project or activity:

  1. Ethical Aspects of Research Designs: (use random assignment, do not withdrawal or reintroduction of an intervention—consultation!!!)
  2. Use of Deception: (clients get vague info on intervention)
  3. Privacy, Confidentiality, Anonymity: Privacy: persons’ interest in controlling the access of others to themselves; Confidentiality: extension of privacy, agreement between researcher and participant on how data are to be handled in keeping with the subjects; interest in controlling the access of others to info about themselves; Anonymity: the names and other unique identifiers of subjects are never attached to the data or know to the researcher. (protection of data, individual’s privacy, plan if participants become upset during data collection or other research activities).

As social workers, when are you obligated by NASW and Michigan Law to breech confidentiality anytime, inside and outside of work? Suspected Homicide, Suicide, Child Abuse

  1. Conflicts of Interest: (participants are or were clients, results affect you directly or indirectly)
  2. Reporting of Results: (protect confidentiality and report accurately, protect clients from harm; obligated to be honest and accurate)
  3. Disclosure of Results to Research Participants: (generally share research data and results with clients, colleagues, or public; may hold results if it is to protect participants. Determine “right to know” audience.
  4. Acknowledgment of Credit: (collaborators, contributors—conversations, presentations, conferences, classes, manuals, web resources, program materials, published and unpublished references, other forms of media (radio, television) (When find a resource write down citation right away: name of author, publisher name, publication date, publisher location, name of book/article, write down date you got it and website when applicable; keep copy for your records and resources)). 

http://owl.english.purdue.edu/handouts/research/r_apa.html and http://webster.commnet.edu/apa/index.htm are good Internet sources for APA citations.

 

Structure in Research Interviewing

A)    Structured Interviews: exact directions and sequencing of the interview. Open and close ended questions.

 

B)     Semistructured (focused) Interviews: (selected topics and hypotheses but specific items are not entirely predetermined. Requires skilled, trained interviewer. (What is it that is to be learned and how much is already known about it? To what extent are the interviewers trained, prepared, and able to elicit data on their own from their research participants (interviewees)? To what extent is the simplicity of coding responses (implications for validity) to be a determining factor?

 

C)    Unstructured Interviews: Only general problem area determined in advance. Freedom to discuss wider range and depth. Usually neutral questions. Difficult to code and analyze.

 

Again, the purpose of a lit review is to:

¨      Provide you with existing theory

¨      Develop a justification for your study (how your work will address a need or contribute to an unanswered question)

¨      Inform your decisions about methods, alternative approaches, or potential problems with your plan

¨      Be a source of data to test or modify your theories

¨      Help you in generating your own theory

 

Random Sampling: means the sampling method is free of human judgment, population has equal opportunity of being selected for the sample.

 

Longitudinal Designs

 

BE ABLE TO DESCRIBE SOME OF THE PROS AND CONS OF GROUP-ADMINISTERED SRUVEYS, MAIL SURVEYS, AND TELEPHONE SURVEYS.

Group-Administered Surveys

 

Mail Surveys

 

Low Response Rates

 

BE ABLE TO NAME OR LIST AND DISCUSS SOME OF THESE TIPS

Tips

¨      Good cover letter. In simple language, write to the audience, Lots of white space and as short as possible

¨      Date of mailing of questionnaire

¨      Identify you and institution,

¨      Give brief synopsis of the purpose of the study

¨      Give potential benefits to policy, practice, and the participant

¨      Explain the importance of the participants participation

¨      Slightly over-estimate the time to take the question (thru pilot testing)

¨      Tell how info will be confidential and how it will be used

¨      Explain how to return the form

¨      Give contact person for questions or comments about the survey

 

Costs

¨      Minimize cost maximize clarity, appearance and readability.

¨      Stamped-addressed envelopes, business reply mail, postcards

 

 

Confidentiality thru participant coding system. Id # on the survey. Anonymous by no identifying information the survey (don’t know who returned survey and who didn’t)

 

If the survey is confidential, send a follow-up letter and another copy of the survey to the participants that haven’t returned the survey. Also, telephone reminders, but not necessarily offer the survey over the phone, that would change the nature of your data collection methods and would introduce a potential bias in the responses.

 

 

Telephone Surveys

¨      What are the pros and cons of a telephone survey compared to interviewing and mail surveys?

¨      Have many of the benefits of face-to-face interviewing but quicker to implement and less expensive than face-to-face, no transportation involved, local calls are inexpensive, no printing expenses. All things considered most telephone surveys end up being more expensive than mail surveys though.

¨      But allows for interviewer bias, no visual contact, rapport-building is unlikely, the surveys must usually be shorter or respondents may fatigue.

¨      Sometimes cover letters and the survey are mailed first to be used as a reference for the interviewee to use during the phone survey.

¨      Proportion of people without telephones (poor, unlisted numbers, cell phone exclusively, in rural areas). Even addresses (homeless, precariously housed, doubled occupancy). Be aware of the likelihood for measurement error (construct validity: how instrument measure the theoretical construct, i.e., how well does your survey on depression measure depression and not anxiety, fatigue, boredom, frustration, etc.)

 

Sampling in Phone Surveys

¨      Random Digit Dialers (RDD). Not totally random because it sometimes goes by phone prefixes (269). I live in Kzoo, but I still have a Conn. number and don’t plan to change it. Toll free lines. Caller ID. Not appropriate for surveying students from KPS graduating class of 1995 by using a list of names of the graduates and the (269) area code, (b/c many moved and have totally new addresses and area codes; your sample would only be representative of those that remained in Kzoo).

¨      Some surveys will have a machine or recorded voice tell you to call a number to complete a survey. This is not survey research as the results are not generalizable.

 

 

Computerized Data Collection

¨      Coding is the translating of data to a form that is readable to a computer. For recoding and analyzing data (Microsoft Excel; also Statistical Package for the Social Sciences (SPSS), SAS, Microsoft Access, among others). Can also include audio and video data.

¨      The more steps to coding the more likely data errors will occur (misspelling changes the mean, adapting response but gets the wrong intended meaning, put data into wrong cell, misses or adds twice, types in numbers incorrectly, limited level of computer experience for the person who performs the coding, etc.

¨      Problems with computer or email surveys: access and ownership of computers (socioeconomics, ethnicity—digital divide leads to problems of the representativeness of your sample. Who is it representative of? (people who have enough access and background in computers to be able to answer the survey, and the question of who is actually responding? Also, likely to go right to the junk mail file as an unrecognized email address. Hard to convince people of the legitimacy of the project through email.

 

A positivist’s assumptions

 

 

 

A quasi-experiment: no random selection or assignment of participants to groups. treatment is applied to naturally occurring groups that may differ for various reasons other than the particular treatment. And uses classification variables; researcher must establish that the groups don’t differ on the accidental characteristics that may affect the final score. Quasi-experimental design (not randomly assigned but use a similar comparison group with which to compare results). Carefully matching the tx and control groups, aids in the elimination of rival explanations.

 

 

Is age measureable? Years. Months. Is depression measureable?

 

BE ABLE TO IDENTIFY AN INDEPENDENT AND DEPENDENT VARIABLE IN A RESEARCH QUESTION AND HYPOTHESIS

Independent variable is age—it affects the dependent variable. Dependent variable is depression—it is the one affected.

 

 

Nuisance variables: factors that influence the value of the dependent variable other than the treatment of interest. (Sleep, IQ, socioeconomic status, time of day of the test, room temperature extremes, etc.

 

depression affect ethnicity? Having depression could never influence if you where Caucasian or Asian. Restated: What affect does ethnicity (independent variable) have on the likelihood of depression (dependent variable)? Not how does depression affect ethnicity.

 

Good hypotheses have…

 

BE ABLE TO DISCUSS SOME OF THE METHODS FOR INFERRING CAUSATION. POSSIBLE ESSAY QUESTION.

Eight Strategies for Inferring Causation

  1. Ask the observers. What experiences in undergrad were the most important? What elements from your undergrad in social work are relevant to you today? What has improved your sadness and depression? (Asks the people directly affected and ask the people who observed the effects on the participants).
  2.  Check whether content matches outcome: If alcohol reduction program is truly reducing relapses in drinking, then we expect that alcoholics that avoid relapsing used these strategies from treatment not from prior knowledge or from other sources or programs. Look for strategies not learned (counterexamples). (Edison, 2000 ways to learn how not to make a light bulb).
  3. Other patterns: modus operandi. Detective searching for clues. OR start with the suspect and trace down the causal chain to see what impacts. Every time evidence is consistent with the expected “trace” left by the suspect, confidence is gained that the suspect is increased. On the other hand, if evidence is contradictory, it reduces that causal chain (or suspect). Missing evidence makes the explanation more doubtful. In empirical studies what almost 100% confidence that you’re sample is representative of the population and there is no error. In evaluation, beyond a reasonable doubt.
  4. Check whether the timing of outcomes makes sense. At the same time or after whatever caused it? Distal outcomes (far downstream in causal chain)—(a) Is it too early to expect change? Is it unrealistically quick? The expect time for change may be in the relevant literature. (b) is the timing of the outcome better or logically attributable to other causes? (c) Outcome do not occur out of sequence (in health promotion program: blood pressure drops prior to changes in eating and exercise).
  5. Check dose is related to response. (the higher the dose, the greater the response…to a point (point of diminishing return or ceiling effect)) Is the magnitude of change logical for the duration and intensiveness of the treatment? Better to give multiple doses and make multiple response observations; also different contexts.
  6. Make Comparisons with a control or comparison group. Experimental (control grp.) or quasi-experimental design (comparison grp). Sample sizes must be large enough.
  7. Control Statistically for extraneous variables. If you were examining a new teaching technique for math instruction, even if you had a control group, you might want to be sure that prior aptitude was not causing your results to look better or worse.
  8. Identify and check the underlying causal mechanism. How do we know cigarette smoking may increase the likelihood for lung cancer? How do we know they’re not just correlated (or just share some type of relationship but one’s not causing the other)? Researchers identified several substances known to be carcinogenic in cigarette smoke (thru experimental design); so it’s harder to argue that there’s not some type of causal relationship.

 

 

Qualitative Research

 

 

 

Phases

1.      Problem Identification: move from a part to a whole; id key concepts and loosely defined variables

2.      Question Formulation: general research question or working hypothesis

3.      Research Design: ethnography, goal-free evaluation

4.      Collecting Data: All data processed is done so thru the researcher(s). You are a tool in data collection. Principles in data collection: (1) be aware of your own biases—your notes are sources of data (2) participants tell you their stories and you tell them your understanding or interpretation of their stories (3) data collection has multiple sources and multiple methods (triangulation) (limitations of triangulation—resources, and measurement instruments that aren’t comparable—commensurate measure: measured on the same scale).

5.      Analyzing Data: iterative

6.      Interpreting Data: iterative

7.      Presentation of Findings: generally lengthier than quant reports

8.      Dissemination of Findings

 

Principles of Observation

 

 

BE ABLE TO DEFINE REACTIVITY AND GIVE AN EXAMPLE OR RECONGNIZE AND EXAMPLE OF REACTIVITY. POSSIBLE ESSAY QUESTION.

 

 

Participant Observation

 

Why measure things:

 

UNDERSTAND AND BE ABLE TO DEFINE VALIDITY AND RELIABILITY

Measurement Validity and Reliability                      

 

 

   Target E.g.

 (1) (upper left) no reliability, no validity; (2) (upper right) reliability, no validity;

       (3) (bottom) reliability and validity

 

Validity

Measures what it is intended to measure. The extent to which answers correspond to some hypothetical “true value” of what we are trying to describe or measure. The score that you get reflect the true differences not error in measurement. (1) Does the instrument measure the variable? If so, how accurately?

 

 

Types of Validity (Content, Criterion, Construct)

 

1. Content Validity: the extend to which the content of a measuring instrument reflects the concept that is being measured and in fact measures that concept and not another. All variables have operational definitions. Data gathered must be directly relevant to these variables. Logical sample of questions from the universe of questions.

 

 

2. Criterion Validity: the scores obtained on a measuring instrument are comparable with scores from an external criterion believed to measure the same concept. [The Beck Depression Inventory (BDI) and The Hamilton Rating Scale for Depression (HAM-D)]. Mile and Kilometer.

 

 

3. Construct Validity: instrument successfully measures a theoretical construct; the degree to which explanatory concepts account for variance in the scores of an instrument. Examines concepts (unmeasurable ideas). What is the instrument measuring, how and why it operates the way it does.

 

Convergent Validity: the degree to which different measures of a construct yield similar results, or converge. Evidence from different sources and collected in different ways leads to the same or similar measure of the concept; and if given to people in two different states, it should yield similar results in both groups. (Two different way for measuring opinions on George Dubya—opinion mail survey and email correspondence from within the White House.

 

Discriminant Validity: a concept can be empirically differentiated from other concepts. Prior research or statistical procedures have demonstrated that a difference that can be measured exists separating anger from frustration, bereavement from clinical depression.

 

Examine and weigh the various approaches to instrument validation and ask yourself:

A)                How well does this instrument measure what it should measure?

B)                 How well does this instrument compare with one or more external criteria that claims to measure the same thing

C)                What does this instrument mean? What is it in fact measuring? How and why does it operate the way it does?

 

BE ABLE TO DEFINE AND DISCUSS SOURCES OF MEASUREMENT ERROR.

Sources of Measurement Error

Measurement Error: variation in responses on a measurement instrument that cannot be attributed to the variable being measured. A goal is to minimize error.

 

BE ABLE TO DEFINE AND DISCUSS SOURCES OF RANDOM ERROR.

 

Random Error

o       Unknown or uncontrolled factors affecting the variable being measured and the process of measurement in an inconsistent fashion. Over and under-estimate the true differences.

o       The larger the sample the more random error cancels each other out and scores tend toward the population mean.

o       Types of random error

o       Transient qualities of the respondent (vary day to day, moment to moment)

o       Situational factors in measurement (seating arrangement, work space, noise, lighting, presence of recording device, social setting)

o       Factors related to the administration of the instrument (uniformity of instrument applications—add or omit material, change wording of questions or instructions, different criteria or info to classify behaviors). Standardization helps minimize subjectivity

o       Administrators demeanor, appearance, demographics can affect how an individual responds. Build rapport (create interest, cooperation, time getting acquainted, increase motivation, reduce anxiety, make sure are capable of completing tasks. Environment conducive to type of response, clear standardized instructions, and trial runs of instrument.

o       Other types of error specifically in surveys:

o       Sample selection bias: sample from an incomplete list (out of date); sample from wrong place (E.g., why people may or may not use city park and sample from people at the park—should sample from those not using park. Use random sampling to draw from the population you want to study!

o       Respondents are usually biased in some way, to the best solution is as high a response rate as possible. With a high response rate, the non-responders would have to be very different from the responders to affect your overall estimates.

o       Item nonresponse error: failure of respondents to answer individual questions—blank questions, accidentally skipped items, do not follow instructions, write marginal comments that can’t be equated with printed categories.

o       Response error: respondents misunderstand the wording. Make all respondents understand the items in the same way and can provide answers for every item (mutually exclusive response options). Make items Clear and do not go beyond what is reasonable to expect people to know or remember

 

What is validity? Measurement instrument actually measures what it intends and does so accurately. What were the three main types? Content, Criterion, and Construct. Must ask yourself what is it valid for and for whom?

 

Content Validity in Standardized Measurement Instrument

  1. Each item must represent an aspect of the variable being measured
  2. Questions empirically related to the construct being measured?
  3. The instrument should discriminate between individuals at low and high extremes, and middle
  4. Double-barrel questions or vague interpretations should be avoided
  5. Some questions should be worded positive and others negatively (yes half, no other half) Alternating positive and negative wording for questions breaks up social desirability response set
  6. Questions should be short
  7. Avoid negative questions
  8. Avoid biased questions (derogatory statements, slang terms, and prejudicial or leading questions)

 

Criterion Validity

What is Criterion Validity? Can anyone come up with examples? Process of comparing scores on a measurement instrument with an external criterion. E.g., A) schools grades, credits. B) Contrasting groups or groups that are assumed to be different. C) psychiatric diagnoses D) other instruments E) different observers

 

Construct Validity

What is construct validity? Degree to which an instrument measures a theoretical construct or an unobservable characteristic or trait. A) predict developmental changes B) use other measurement instruments with proven construct validity to validate new instruments—should correlate with old instruments C) convergent-discriminant validation: Convergent Validity: the degree to which different measures of a construct yield similar results, or converge. Evidence from different sources and collected in different ways leads to the same or similar measure of the concept; and if given to people in two different states, it should yield similar results in both groups. Discriminant Validity: a concept can be empirically differentiated from other concepts. D) pretest posttest

 


Construction of Standardized Instruments

 

Question Selection

To enhance content validity…

A) Rational-Intuitive method: choose questions in a logical manner.

B) Empirical method: statistics used to select questions.

 

Response Category Selection

Possible response options devised (Likert scale)

 

Number of Categories

How many response categories? What is the problem with to much variance? And too little?

A)    Large enough to allow for some variance but small enough so that discrimination between levels can be made.

B)     Odd or Even number of response categories

The response-value continuum: decisions about how respondents should be rated to frequencies or to agree-disagree dichotomies. Yes/no then scale question.

 

Determination of Instrument Length

Typically, the longer the instrument, the greater the reliability. However, the more difficult to administer and more likely that the respondents will satisfice

 

“Satisfice” is the phenomenon that occurs when people settle for satisfactory solutions to problems rather than seeking optimal ones in a variety of domains. Given a constant stimulus, respondent burden increases with the motivational and cognitive demands of the survey. With an increase in cognitive demands, a respondent is likely to make less effort, evident by less variation in her or his ratings. When respondents grow impatient, fatigued, or disinterested, the cognitive demands may make them particularly susceptible to “satisficing.” Additionally, a respondent is most likely to satisfice when the costs of optimizing are high, particularly with difficult or demanding questions than with easier questions.

 

KNOW THE TYPES OF MEASUREMENT INSTRUMENTS

Basic Types of Measurement Instruments:

Rating scales, summated scales, modified scales

 

 

 

Rating Scales

Rating of individuals, objects or events on various traits or characteristics at a point on a continuum or a position in an ordered set of response categories. Numerical values are assigned to each category.

 

Selection of a Standardized Instrument

§         3 Considerations in Selecting a Measurement Instrument: measurement need, finding instruments capable of measuring the variables, and assessing alternatives instruments

 

·                    Determining Measurement Need

A)                Why will the measurement occur?

§               Research

§               Assessment/diagnosis

§               Evaluation

B)        What will be measured?

§               Specify________

C)        Who is appropriate for making the most direct observations?

§               Research participant/client

§               Practitioner or researchers

§               Relevant other

D)        Which type of format is acceptable?

§               Inventories and surveys

§               Indexes

§               Scales

§               Checklists and rating systems

E)         Where will the measurement occur?

§               General setting

§               Situation-specific environment

F)         When will the measurement occur?

§               Random

§               Posttest only

§               Repeated over time