TutorChase logo
Login
AQA A-Level Psychology Notes

7.3.2 Primary and secondary data

AQA Syllabus focus:

'Primary and secondary data, including meta-analysis as a form of secondary data.'

Psychologists can gather data themselves or use data that already exists. Understanding this distinction is important because it affects cost, control, relevance, and the kinds of psychological questions a researcher can realistically investigate.

Understanding data sources

A central decision in research is whether evidence will be produced by the current investigation or taken from material that already exists. This choice shapes how specific, current, and manageable the data will be.

Primary data

Psychologists often use primary data when they want evidence collected specifically for their own research question.

Primary data collected directly by the researcher for the current investigation.

Because primary data is gathered first-hand, the researcher can decide exactly what information is needed, who will provide it, and how it will be recorded. This means the data can be closely matched to the purpose of the study rather than inherited from someone else’s design.

In psychology, primary data might include participants’ responses, scores, ratings, or observed behavior collected for the present study. The key point is that the researcher is producing new data rather than reusing existing material.

One major strength of primary data is relevance. Since it is collected for a particular investigation, it should fit the research question well. Researchers also have more control over the quality of data collection, such as making sure instructions are clear, measures are suitable, and records are complete.

However, primary data can be time-consuming and expensive to obtain. Recruiting participants, collecting responses, and organizing data all require resources. In some areas of psychology, it may also be difficult or unethical to gather the desired information directly, especially if the topic is sensitive or involves unusual populations.

Secondary data

Researchers may instead rely on secondary data, using information that already exists before the current investigation begins.

Secondary data that has already been collected by another researcher or for another purpose and is then used in a new investigation.

Secondary data in psychology can come from sources such as published journal articles, archived datasets, official statistics, institutional records, or reports produced by organizations. The current researcher does not generate the original data but analyzes or interprets it for a new purpose.

A major strength of secondary data is efficiency. It is usually quicker and cheaper to access than collecting primary data from scratch. It can also give researchers access to large amounts of information, including data collected over long time periods or from groups that would be hard to reach directly.

Secondary data may be especially useful when psychologists want to study trends, historical changes, or issues where direct collection would be impractical. For example, existing records may provide information that no individual researcher could realistically gather alone.

The main limitation is reduced control. The researcher cannot go back and change how the data was originally collected. This means the measures, sample, or procedures may not match the new research question very well. Secondary data can also be incomplete, outdated, or missing important contextual details about how it was produced.

Comparing primary and secondary data

The distinction between the two types of data can be understood through a few key contrasts:

  • Origin: primary data is first-hand; secondary data already exists.

  • Purpose: primary data is collected for the current study; secondary data was originally gathered for a different purpose or earlier investigation.

  • Control: primary data gives greater control over what is collected; secondary data gives less control.

  • Resources: primary data usually takes more time and money; secondary data is often more economical.

  • Fit to the question: primary data may be more precisely targeted; secondary data may be broader but less tailored.

Neither type is automatically better. The best choice depends on the research aim, the availability of existing material, and whether fresh evidence is necessary.

Meta-analysis as secondary data

A particularly important form of meta-analysis brings together findings from many existing studies to examine the overall pattern of evidence.

Pasted image

PRISMA flow diagram summarising the study-selection process in a systematic review and meta-analysis (records identified, screened, assessed for eligibility, and included). It makes transparent how secondary sources are filtered before results are combined. Source

Meta-analysis a method that combines the results of a number of previous studies on the same topic.

A meta-analysis is a form of secondary data because the researcher is not collecting new participant responses. Instead, they use results that have already been produced by earlier studies. The data being analyzed is therefore pre-existing.

In psychology, meta-analysis is valuable because individual studies do not always agree. One study may find a strong effect, another a weak effect, and another no effect at all. By combining many studies, a meta-analysis can give a clearer picture of the overall evidence than a single study alone.

Pasted image

Forest plot showing effect sizes from multiple individual studies (squares with horizontal confidence-interval lines) and the pooled overall estimate (diamond). It visually summarises whether results cluster around ‘no effect’ and how much each study contributes to the combined conclusion. Source

Meta-analysis can also increase confidence in findings because it considers a wider body of research. This is especially helpful when individual studies have small samples or limited scope. Looking across many studies may reveal whether a psychological effect appears consistently.

However, the quality of a meta-analysis depends on the quality of the studies it includes. If the original research is weak, biased, or inconsistent, the combined result may still be misleading. Researchers must therefore be careful about which studies are included and how fairly the evidence is combined.

Choosing between data sources

When deciding whether to use primary or secondary data, psychologists should think about:

  • whether existing data actually answers the research question

  • whether the source is current and trustworthy

  • whether enough detail is available about how the data was gathered

  • whether collecting new data is realistic in terms of time, cost, and access

  • whether the study would benefit from broad existing evidence, such as in a meta-analysis

In practice, psychological research often begins by examining secondary data and then identifies whether new primary data is still needed.

Practice Questions

Outline what is meant by secondary data in psychological research. (2 marks)

  • 1 mark for stating that the data already exists or has already been collected.

  • 1 mark for stating that it was collected by another researcher or for a different/original purpose and is then used in a new investigation.

Discuss one strength and one limitation of using meta-analysis as a form of secondary data in psychology. (6 marks)

  • 1 mark for identifying a relevant strength of meta-analysis.

  • 1 mark for explaining that strength in relation to psychological research.

  • 1 mark for developing the explanation, for example by referring to combining many studies or identifying an overall pattern of findings.

  • 1 mark for identifying a relevant limitation of meta-analysis.

  • 1 mark for explaining that limitation in relation to psychological research.

  • 1 mark for developing the explanation, for example by referring to the quality of included studies or possible bias in available published research.

FAQ

Yes.

If a psychologist collects the data directly for their own study, it is primary data for them. If another psychologist later uses that same dataset for a different analysis, it becomes secondary data for the second researcher.

So the label depends on the researcher’s relationship to the data, not on the data itself.

Raw secondary data is the original dataset, such as individual participant scores or archived records.

Summarized secondary data is data that has already been processed, such as averages, percentages, or conclusions reported in a journal article.

Raw data usually gives more flexibility because the new researcher can reanalyze it. Summarized data is quicker to use but limits what can be checked or recalculated.

If a meta-analysis only includes published studies, it may miss research that found weak or non-significant results.

This matters because studies with significant findings are often more likely to be published. As a result, the overall pattern can look stronger than it really is.

Including unpublished studies can give a more balanced picture of the evidence.

They should check:

  • who collected the data

  • why it was collected

  • how recent it is

  • whether the method is clearly described

  • whether the source is reputable

  • whether there are missing data or obvious gaps

A trustworthy source should be transparent about where the data came from and how it was produced.

Secondary data is very useful when a researcher wants to examine change over time.

For example, if a psychologist wants to study trends across many years, collecting that amount of new data would be slow or impossible. Existing records or archived findings may already contain the historical information needed.

This makes secondary data especially helpful for questions about development, social change, or long-term patterns in mental health.

Hire a tutor

Please fill out the form and we'll find a tutor for you.

1/2
Your details
Alternatively contact us via
WhatsApp, Phone Call, or Email