제품

SurveyMonkey는 모든 사용 사례와 요구를 다루도록 구축되었습니다. 제품을 둘러보고 SurveyMonkey로 어떤 효과를 누릴 수 있는지 알아보세요.

온라인 설문조사의 글로벌 리더로부터 데이터 기반 인사이트를 얻으세요.

100개 이상의 앱 및 플러그인과 연동하여 업무 효율성 향상

정보 수집과 결제를 위한 온라인 양식을 만들고 맞춤화하세요.

빌트인 AI을 통한 더 나은 설문조사 작성과 빠른 인사이트 발견

시장 조사에 필요한 모든 것을 갖춘 솔루션

템플릿

비즈니스에 대한 고객 만족도와 충성도를 측정

고객을 만족시켜 지지자로 만드는 것이 무엇인지 파악

실행 가능한 인사이트를 얻어 사용자 경험을 개선

잠재 고객, 참석자 등으로부터 연락처 정보를 수집

다음 이벤트를 위해 쉽게 RSVP를 받고 확인

다음 이벤트의 개선을 위해 참석자가 무엇을 원하는지 파악

참여도를 높이고 더 나은 결과를 이끌어낼 인사이트를 발견

참석자들의 피드백을 받아 회의 운영 방법을 개선

동료 피드백을 통한 직원 성과 향상

더 나은 코스를 만들고 교수법을 개선

학생들이 코스 자료 및 프레젠테이션을 어떻게 평가하는지 파악

신제품 아이디어에 대한 고객의 생각을 파악

리소스

설문조사 및 설문조사 데이터 사용에 대한 모범 사례

설문조사, 비즈니스를 위한 팁 등에 관한 블로그

SurveyMonkey 이용에 대한 튜토리얼 및 사용법 가이드

최고의 브랜드들이 SurveyMonkey로 성장을 견인하는 방법

영업팀에 문의로그인
영업팀에 문의로그인

Sampling bias and how to avoid it

Learn more about sampling bias and why it’s a common problem. Avoid it completely by using SurveyMonkey Audience.

Sampling bias is a known issue. It occurs in studies performed by those new to research as well as seasoned researchers. It is important to understand what sampling bias is and how it happens in order to avoid it in your research efforts. Today, we’ll explain what sampling bias is and help you prevent it in your market research to ensure honest, accurate results.

Sampling bias is a type of survey bias that occurs when a research study does not use a representative sample of a target population. In other words, you gather data from a group in which some members of the intended population have a higher or lower sampling probability than others. 

This unbalanced sample can affect the validity of the research data and results, and it can also limit the extent to which conclusions can be generalized to a larger population.

Sampling bias usually happens unintentionally and is commonly caused by using convenience or purposive sampling strategies.

There are two common causes of sampling bias:

  1. Poor methodology: the most accurate sampling method is simple random sampling. This method allows for a large number of respondents who are chosen completely at random. When other parameters are set, researchers can unintentionally risk inserting their own selection bias in the process of choosing respondents.
  2. Poor execution: this occurs when the researcher has set forth an accurate methodology, but those implementing the sampling cut corners. If they resort to convenience sampling or don’t follow up with those participants who do not respond, they abandon the carefully formulated methodology and risk invalid results due to sampling bias.

A famous example of sampling bias occurred in the 1948 US presidential election. A telephone survey was conducted during the race, and the results implied a landslide win for Thomas E. Dewey over Harry S. Truman. The researchers did not take into account that telephony was a new science, and those who could afford telephones were wealthy. The researchers did not make an effort to survey citizens in the lower-middle or lower classes who were more likely to vote for Truman. 

Because the sample was not representative of the entire US population, the results were inaccurate and failed to predict the winner of the 1948 presidential race (Truman). However, the front page of the Chicago Tribune trusted the survey results for their early edition and ran the incorrect headline the next morning, “Dewey Defeats Truman.” This was an embarrassing lesson for the Tribune to learn about sampling bias. 

There are several types of sampling bias. Let’s look at some of the most common types:

This is also called exclusion bias and occurs when a portion of the population of interest is not accurately represented in the sample. This was the case in our presidential election example above. US citizens who did not own telephones were excluded from the sample. In today’s world, a similar scenario could take place if a national internet survey was conducted and researchers did not find a way to include the elderly and those with limited or no internet access.

Another undercoverage bias could occur if convenience sampling was used. Convenience sampling utilizes participants that are easy to reach. For example, you may have seen people conducting surveys in high-traffic areas in a large city. Those surveys will probably suffer undercoverage of people who don’t live in the city or drive instead of walk. 

This type of bias takes place when respondents with specific characteristics are more willing to take part in research. In this case, participants volunteer to participate in the study. People who volunteer are more likely to have an opinion on the topic being studied. Conversely, some people will not volunteer to participate because they prefer not to discuss the topic. This leaves the sample with an abundance of people with strong opinions and not enough people who don’t have strong feelings or don’t wish to discuss the topic at hand.

An example of self-selection bias would be a product evaluation survey where participants can choose to participate. Those who have had a strong emotional experience, either positive or negative, are more likely to enter into the study. This skews data by excluding a full range of customer experiences. You can see this bias in effect in customer reviews.

In survivorship bias, the sample is focused on those who pass the selection criteria. Those who do not pass are ignored and are therefore underrepresented.

For example, if your survey only includes current customers, their feedback is more likely to be skewed positive than if you included those who have stopped shopping with you. They have chosen to continue a relationship with your brand, so they are likely feeling positive about their experiences. Customers who no longer purchase your products will have different insights that should be included in your survey for accuracy.

Non-response, or participation bias, occurs when a group of respondents refuses to participate in a study or drop out during the study period. This could be due to the length of the survey, the structure of the questions, or sensitive topics at hand.

Frequently, non-response bias occurs because people do not feel comfortable providing information regarding income, gender identity, age, marital status, and other personal details. Other reasons for non-response issues include lack of interest, lack of time, or simply not wanting to share their feelings about the topic.

An example of non-response bias could be a study into drug use. Questions about how frequently certain drugs are used or what drugs are used most often may cause participants to drop out if they are embarrassed to talk about the subject or are afraid that they will be exposed as engaging in illegal drug use. 

Memory is imperfect, and when your survey participants can’t remember correctly, it results in recall bias. You may be able to reduce recall bias by collecting responses soon after the occurrence you are studying. However, in many cases, you simply can’t do anything to mitigate recall bias. Certain respondents may not recall certain experiences as well as others. 

For example, if you are studying risk factors for a specific kind of disease, people who have had the disease are more likely to recall—and make more effort to remember—than respondents who have been unaffected by the disease.

When researchers consciously or subconsciously influence the interpretation of the data, it results in observer bias. It may take the form of focusing only on a certain dataset or influencing participants during data collection. 

As an example, sometimes researchers are present during participants' interviews. If a researcher inadvertently displays enthusiasm for a certain type of response, the participants may notice and change their responses to please the researcher. In another example, a researcher makes unintentionally erroneous interpretations of data so that the study results fit their hypothesis or expectation. 

Exclusion bias is the result of intentionally excluding specific subgroups from your study. This affects the validity of the study.

For example, excluding a group that has recently moved into the study area. This potentially leads to false connections between research variables, impacting your research outcomes.

This type of sampling bias is most often observed in medical studies. Healthy user bias involves a higher focus on participants who are more active, healthy, and fit than most of the general population. People who are not healthy enough to participate are omitted.

For example, in a drug trial for a cholesterol-lowering medication, the effectiveness may be misrepresented due to factors other than the effects of the medication (the health of the individuals in the study), making the medication appear to be more beneficial than it really is.

Berkson’s Fallacy is the opposite of healthy user bias. In it, researchers only study participants who are very ill, causing an under-representation of healthy people. This results in a false finding of a correlation between variables. 

For example, in 1946, Joseph Berkson studied his patients in the hospital and found that there was a perceived association between diabetes and gallbladder disease. Even though the diseases were independent, most of his patients believed they were related. Berkson’s conclusion about his misleading correlation was that people who are hospitalized are more likely to have many diseases. Information was only collected from inpatients, so they incorrectly correlated the two diseases.

To avoid sampling bias, you need to look carefully at your survey methodology and design. Clearly define your survey goals and define your target audience. Ensure that your process allows an equal opportunity for each member of the target population to be part of your sample group. 

And to reassure your participants, always include a statement at the beginning of your survey that assures participants that their answers will be anonymous and only used for the purposes of your study. 

Here’s an example statement: This survey is anonymous. No one will be able to identify you or your answers, and no one will know whether or not you participated in the study. 

Let’s look at some additional ways to avoid survey sampling bias:

Clearly define the groups in your target study population, and then make sure sufficient data is collected from each group. Provide training to those conducting your study to prevent them from resorting to convenience sampling.

A simple way to avoid convenience sampling is to use SurveyMonkey Audience to reach your target population. You can pick your target audience, send your survey, collect feedback, and analyze your results.

Why are people not responding to your survey? Follow up with non-responders to find out if you’re asking the wrong questions, requesting the wrong information, or targeting the wrong audience. Use the follow-up information to gain actionable insights for your next study.

Create a survey that is brief and easy to understand. Survey studies with complicated queries or too many questions lead to lower survey completion rates.

Clearly define your target audience, parameters for sample selection, and the sampling frame of your study to ensure that relevant, accurate data can be collected.

Establish what you want to accomplish with your survey first. With that in mind, you can determine what sample methodology and survey structure will work best. You’ll have a better understanding of who should participate, the necessary sample size, and how to communicate with your target respondents.

There are two sampling methods that are guaranteed to keep your study free of sampling bias, simple random sampling and stratified random sampling.

Simple random sampling

In this sampling method, participants are chosen completely by chance. There are equal odds for every member of the target population to be selected for the study. This is easily accomplished using an Excel spreadsheet with the formula “=RAND()” to every row of your master list of participants. This will produce a random decimal value for each participant, and you can select any continuous group in the list (e.g. the top 100 or the bottom 100). This method is particularly useful in large studies.

Stratified random sampling

In stratified random sampling, researchers examine the population they are studying and comprise an accurate representative sample. For example, 1,000 people are in the target population of real estate agents, and 10 individuals are required for the study. There are 500 female agents and 500 male agents in the population, so the researcher should ensure equal chance that the sample includes five female and five male agents for the study.

Both of these methods are effective in reducing the chances of sampling bias.

The first step in avoiding sampling bias is understanding what it is, what causes it, and the types of sampling bias. Armed with this information, you can use our tips for avoiding sampling bias,  and keep your study results valid and accurate.

Remember, even the most experienced research professionals can inadvertently commit sampling bias. Refer back to this article and double-check your methodology to ensure your study is bias-free.

The best way to remove the chance of sampling bias is to use SurveyMonkey Audience. You’ll receive responses from your ideal audience for high-quality, bias-free data. Find out more about our survey response tool today!

Collect market research data by sending your survey to a representative sample

Get help with your market research project by working with our expert research team

Test creative or product concepts using an automated approach to analysis and reporting

To read more market research resources, visit our Sitemap.