>
>
>
How to analyse research data as a high school student
How to analyse research data as a high school student
How to analyse research data as a high school student | RISE Research
How to analyse research data as a high school student | RISE Research
RISE Research
RISE Research

TL;DR: Data analysis is the process of examining, organising, and interpreting the information you collected during your research study. For high school students, it is the step that turns raw numbers or interview responses into findings that actually answer your research question. Done well, it strengthens your paper, your publication chances, and your university application narrative. This post gives you a concrete, step-by-step process for how to analyse research data as a high school student, including the tools, common mistakes, and the points where most students working alone get stuck.
Introduction
Most high school students think data analysis means running numbers through a spreadsheet and reporting what comes out. It does not. Data analysis is the process of deciding which patterns in your data are meaningful, which are noise, and how to present your findings in a way that holds up to academic scrutiny. Knowing how to analyse research data as a high school student requires more than arithmetic. It requires judgment about what your data can and cannot support.
The gap between what students think analysis involves and what it actually involves is where most research projects stall. A student might collect 80 survey responses, calculate averages, and assume the work is done. But without choosing the right statistical test, checking for confounding variables, or connecting findings back to the original research question, the analysis section of a paper will not survive peer review. This post walks you through exactly how to do it correctly, step by step.
What is data analysis and why does it matter for your research paper?
Answer Capsule: Data analysis is the structured process of cleaning, organising, and interpreting collected data to answer a specific research question. For high school researchers, it is the step that produces publishable findings. Without rigorous analysis, a research paper has no results section, no conclusions, and no academic contribution.
Data analysis sits between data collection and the writing of your results and discussion sections. It is not optional and it cannot be improvised. A research paper without proper analysis is a collection of observations with no argument. Journals reject papers at this stage more than at any other, because weak analysis means the conclusions cannot be trusted.
For high school students aiming at publication, the stakes are specific. If you are submitting to a peer-reviewed journal, reviewers will assess whether your chosen method matches your research question, whether your sample size justifies your conclusions, and whether you have accounted for limitations. If you are using your research as part of a university application, the analysis section is where admissions readers see whether you understand your own work or simply collected data and handed it to a teacher to interpret.
Getting this step right separates a research project that earns recognition from one that stays in a drawer. Scholars in the RISE Research publications archive consistently produce analysis sections that meet journal standards, because they approach this stage with a clear method from the beginning.
How to analyse research data as a high school student: a step-by-step process
Step 1: Return to your research question before touching your data. Before opening a spreadsheet or reading a single interview transcript, write your research question at the top of a blank document. Every decision you make during analysis should be tested against it. Students who skip this step end up analysing data that does not answer their question, or worse, changing their question to match whatever their data happens to show. That is called HARKing (Hypothesising After Results are Known) and it is a form of research misconduct.
Step 2: Clean your data. Raw data is almost never ready to analyse. Survey responses will have missing entries, interview transcripts will have irrelevant sections, and experimental records will have outliers that need to be examined and documented. Cleaning means removing duplicate entries, deciding how to handle missing data (and recording that decision), and checking that every data point was recorded correctly. In a spreadsheet tool like Google Sheets or Microsoft Excel, this means reviewing every column for inconsistencies before running any calculations. Document every cleaning decision in a methods log so you can report it accurately in your paper.
Step 3: Choose the right analysis method for your data type. This is the most consequential decision in the entire analysis process. Quantitative data, meaning numbers, requires statistical methods. The simplest include descriptive statistics such as mean, median, and standard deviation, which summarise what your data looks like. If you are comparing two groups, a t-test may be appropriate. If you are looking at relationships between variables, a correlation or regression analysis may apply. Qualitative data, meaning interview responses, open-ended survey answers, or observations, requires thematic analysis: reading through responses, identifying recurring ideas, and coding them into categories. A common mistake is applying quantitative methods to qualitative data or vice versa. The method must match the data type and the research question, not whichever approach feels more familiar.
Step 4: Run your analysis and record every step. Use a tool that produces reproducible results. For quantitative analysis, Google Sheets handles basic statistics; JASP is a free, open-source tool that handles more advanced tests including t-tests, ANOVA, and correlation, with a clear interface designed for students. For qualitative analysis, a tool like MAXQDA (free trial available) or simply a structured coding spreadsheet works well. Record every step you take. If you run a t-test, note which variables you used, what the output was, and what the p-value means in plain language. Reproducibility is a core standard of academic research, and reviewers will expect to see your process described clearly in your methods section.
Step 5: Interpret your findings in relation to your research question. Numbers and coded themes do not interpret themselves. Once your analysis is complete, write one paragraph answering this question: what do these results actually say about my research question? Then write a second paragraph: what do they not say, and what are the limitations? A finding that your survey showed a correlation between two variables does not prove causation. A qualitative theme that appeared in six out of ten interviews is not universal. Precision in interpretation is what separates publishable work from a school project.
Step 6: Connect your findings to the existing literature. Your analysis does not exist in isolation. In your discussion section, you will need to explain whether your findings support, contradict, or extend what previous researchers found. This connection should be visible in how you frame your results. If your data shows a pattern that matches a study you cited in your literature review, name it. If your results differ, explain why they might. This is the step that demonstrates genuine scholarly thinking and it is the step most high school students omit entirely.
The single most common mistake at this stage is reporting results without interpreting them. A table of numbers is not a finding. A finding is a specific, evidence-backed answer to a specific question. Every result you report must be followed by a sentence explaining what it means.
Where most high school students get stuck with data analysis
The first sticking point is choosing the wrong statistical test. Most high school students have studied basic statistics in mathematics class, but academic research uses a wider range of tests, each with specific conditions that must be met before the test is valid. Applying a t-test to data that does not meet the assumption of normal distribution, for example, produces results that reviewers will immediately flag. Without guidance, most students do not know these conditions exist.
The second sticking point is interpreting qualitative data without a systematic coding framework. Reading through interview transcripts and noting what seems interesting is not thematic analysis. Thematic analysis requires a defined coding process, inter-rater reliability checks where possible, and a clear audit trail from raw data to reported theme. Students working alone almost always skip the audit trail, which makes their qualitative findings impossible to verify.
The third sticking point is scope. High school students frequently attempt analysis that their sample size cannot support. Drawing conclusions about a population from 15 survey responses is not statistically defensible, and a PhD mentor will redirect this before the paper is written, not after a journal rejection.
A PhD mentor intervenes at each of these points with specific, experience-based guidance. They know which test fits which question. They have run thematic analysis on dozens of datasets. And they will tell a student directly when a conclusion is overreaching, which is information a student working alone has no way to access.
If you are at this stage and want a PhD mentor to guide you through data analysis and the full research process, book a free 20-minute Research Assessment to see what is possible before the Summer 2026 Priority Deadline.
What does good data analysis look like? A high school example
Answer Capsule: Strong data analysis in a high school research paper states the method chosen and why, reports results with precise values, and interprets findings in direct relation to the research question. Weak analysis reports averages without context, skips the rationale for method choice, and draws conclusions the data cannot support.
Consider a student researching the relationship between sleep duration and academic performance in Grade 11 students.
Weak example: "Most students who slept more did better on tests. The average sleep was 6.8 hours. This shows that sleep is important for grades."
Strong example: "A Pearson correlation analysis (r = 0.61, p = 0.003, n = 45) indicated a moderate positive correlation between nightly sleep duration and standardised test scores among Grade 11 participants. This finding aligns with Zhang et al. (2020), who reported a similar correlation (r = 0.58) in a comparable adolescent sample. The correlation does not establish causation; students with higher academic motivation may manage both sleep and study time more effectively, which represents a confounding variable not controlled for in this study."
The strong example names the specific test used, reports the exact statistical values, acknowledges the sample size, connects the finding to prior literature, and identifies a limitation. It does not overstate what the data shows. Every one of those elements is expected in a peer-reviewed submission and each one is absent from the weak example. The difference is not intelligence. It is knowing the standard and being trained to meet it. Students who have worked through the RISE Research project process produce analysis sections that meet this standard consistently.
The best tools for data analysis as a high school student
JASP is a free, open-source statistical software package developed by the University of Amsterdam. It handles t-tests, ANOVA, correlation, regression, and Bayesian analysis through a clean drag-and-drop interface. It is the most accessible tool for high school students doing quantitative analysis and produces output that can be cited in a paper. The limitation is that it requires a basic understanding of which test to select before you open it.
Google Sheets is sufficient for descriptive statistics including mean, median, standard deviation, and basic charts. It is free, cloud-based, and shareable with a mentor for real-time review. Its limitation is that it does not support more advanced inferential tests, so it works best for exploratory analysis or simple comparisons.
MAXQDA offers a free trial and is one of the most widely used qualitative analysis tools in academic research. It allows students to import interview transcripts, apply codes, and visualise the frequency and relationship between themes. The learning curve is moderate but the output is structured and defensible in a paper.
Google Scholar is not an analysis tool, but it is essential during the interpretation step. When you are connecting your findings to existing literature, Google Scholar allows you to search for papers that reported similar results, check how other researchers interpreted comparable data, and verify that your conclusions are grounded in the field. It is free and indexes most major academic journals.
Zotero is a free reference manager that becomes critical during the analysis and write-up stages when you are citing the studies your findings relate to. It integrates with Google Docs and Microsoft Word and formats citations automatically in APA, MLA, Chicago, and other styles. Students who do not use a reference manager at this stage typically produce citation errors that reviewers flag immediately. For students aiming at publication in journals listed in the Journal of Student Research guide, accurate citations are non-negotiable.
Frequently asked questions about data analysis for high school students
How do I know which statistical test to use for my research?
The choice of statistical test depends on three factors: your research question, the type of data you collected, and the number of groups you are comparing. If you are comparing two groups on a continuous variable, a t-test is typically appropriate. If you are looking at a relationship between two continuous variables, use Pearson correlation. If your data is non-normal or ordinal, use non-parametric equivalents such as the Mann-Whitney U test. JASP includes a built-in guide that walks you through test selection based on your data type.
Can I analyse research data without knowing advanced statistics?
Yes, but only up to a point. Descriptive statistics including mean, median, and standard deviation require only basic maths and are sufficient for some research questions. However, most peer-reviewed journals expect inferential statistics that test whether your findings are statistically significant. Learning to run and interpret a t-test or correlation in JASP is achievable for a motivated high school student, particularly with mentor guidance. Attempting advanced multivariate analysis without training typically produces errors that reviewers catch immediately.
How many survey responses do I need for valid data analysis?
The minimum sample size depends on the statistical test you plan to use and the effect size you expect to detect. As a general rule, fewer than 30 responses make most inferential statistics unreliable. For a simple correlation or t-test, 30 to 50 responses allow for basic analysis with appropriate caveats. For more complex designs, sample size calculators such as G*Power (free) can determine the minimum needed before you begin data collection, which is the correct time to make this decision.
What is thematic analysis and how do I do it for qualitative data?
Thematic analysis is a method for identifying, analysing, and reporting patterns within qualitative data such as interview transcripts or open-ended survey responses. The process involves reading through your data multiple times, assigning codes to meaningful segments, grouping codes into broader themes, and reviewing whether those themes accurately represent the dataset. Braun and Clarke's six-phase framework is the most widely cited approach and is accessible to high school students. MAXQDA supports this process with digital coding tools.
How do I report data analysis results in a research paper?
Results should be reported in a dedicated results section that states what the analysis found without interpretation. For quantitative data, report the test used, the test statistic, the degrees of freedom, the p-value, and the effect size. For example: t(43) = 2.31, p = 0.025, d = 0.68. For qualitative data, present each theme with supporting quotes from your data. Interpretation of what the results mean belongs in the discussion section, not the results section. Mixing these two sections is one of the most common structural errors in high school research papers.
Conclusion
Data analysis is the step that determines whether your research produces findings or just produces data. The three things that matter most are choosing the right method for your research question, reporting results with precision and appropriate statistical values, and interpreting findings honestly within the limits of what your data can support. These are learnable skills. They are also skills that take time to develop without feedback from someone who has done them before.
Knowing how to analyse research data as a high school student is not just about completing a project. It is about producing work that holds up in a peer-reviewed journal and that demonstrates genuine scholarly capability to university admissions readers. The outcomes achieved by RISE Research scholars reflect what becomes possible when students approach this stage with structured expert support. The Summer 2026 Priority Deadline is approaching. If data analysis is a step you want to get right with expert guidance behind you, schedule a free Research Assessment and we will match you with a PhD mentor who has done this in your subject area.
TL;DR: Data analysis is the process of examining, organising, and interpreting the information you collected during your research study. For high school students, it is the step that turns raw numbers or interview responses into findings that actually answer your research question. Done well, it strengthens your paper, your publication chances, and your university application narrative. This post gives you a concrete, step-by-step process for how to analyse research data as a high school student, including the tools, common mistakes, and the points where most students working alone get stuck.
Introduction
Most high school students think data analysis means running numbers through a spreadsheet and reporting what comes out. It does not. Data analysis is the process of deciding which patterns in your data are meaningful, which are noise, and how to present your findings in a way that holds up to academic scrutiny. Knowing how to analyse research data as a high school student requires more than arithmetic. It requires judgment about what your data can and cannot support.
The gap between what students think analysis involves and what it actually involves is where most research projects stall. A student might collect 80 survey responses, calculate averages, and assume the work is done. But without choosing the right statistical test, checking for confounding variables, or connecting findings back to the original research question, the analysis section of a paper will not survive peer review. This post walks you through exactly how to do it correctly, step by step.
What is data analysis and why does it matter for your research paper?
Answer Capsule: Data analysis is the structured process of cleaning, organising, and interpreting collected data to answer a specific research question. For high school researchers, it is the step that produces publishable findings. Without rigorous analysis, a research paper has no results section, no conclusions, and no academic contribution.
Data analysis sits between data collection and the writing of your results and discussion sections. It is not optional and it cannot be improvised. A research paper without proper analysis is a collection of observations with no argument. Journals reject papers at this stage more than at any other, because weak analysis means the conclusions cannot be trusted.
For high school students aiming at publication, the stakes are specific. If you are submitting to a peer-reviewed journal, reviewers will assess whether your chosen method matches your research question, whether your sample size justifies your conclusions, and whether you have accounted for limitations. If you are using your research as part of a university application, the analysis section is where admissions readers see whether you understand your own work or simply collected data and handed it to a teacher to interpret.
Getting this step right separates a research project that earns recognition from one that stays in a drawer. Scholars in the RISE Research publications archive consistently produce analysis sections that meet journal standards, because they approach this stage with a clear method from the beginning.
How to analyse research data as a high school student: a step-by-step process
Step 1: Return to your research question before touching your data. Before opening a spreadsheet or reading a single interview transcript, write your research question at the top of a blank document. Every decision you make during analysis should be tested against it. Students who skip this step end up analysing data that does not answer their question, or worse, changing their question to match whatever their data happens to show. That is called HARKing (Hypothesising After Results are Known) and it is a form of research misconduct.
Step 2: Clean your data. Raw data is almost never ready to analyse. Survey responses will have missing entries, interview transcripts will have irrelevant sections, and experimental records will have outliers that need to be examined and documented. Cleaning means removing duplicate entries, deciding how to handle missing data (and recording that decision), and checking that every data point was recorded correctly. In a spreadsheet tool like Google Sheets or Microsoft Excel, this means reviewing every column for inconsistencies before running any calculations. Document every cleaning decision in a methods log so you can report it accurately in your paper.
Step 3: Choose the right analysis method for your data type. This is the most consequential decision in the entire analysis process. Quantitative data, meaning numbers, requires statistical methods. The simplest include descriptive statistics such as mean, median, and standard deviation, which summarise what your data looks like. If you are comparing two groups, a t-test may be appropriate. If you are looking at relationships between variables, a correlation or regression analysis may apply. Qualitative data, meaning interview responses, open-ended survey answers, or observations, requires thematic analysis: reading through responses, identifying recurring ideas, and coding them into categories. A common mistake is applying quantitative methods to qualitative data or vice versa. The method must match the data type and the research question, not whichever approach feels more familiar.
Step 4: Run your analysis and record every step. Use a tool that produces reproducible results. For quantitative analysis, Google Sheets handles basic statistics; JASP is a free, open-source tool that handles more advanced tests including t-tests, ANOVA, and correlation, with a clear interface designed for students. For qualitative analysis, a tool like MAXQDA (free trial available) or simply a structured coding spreadsheet works well. Record every step you take. If you run a t-test, note which variables you used, what the output was, and what the p-value means in plain language. Reproducibility is a core standard of academic research, and reviewers will expect to see your process described clearly in your methods section.
Step 5: Interpret your findings in relation to your research question. Numbers and coded themes do not interpret themselves. Once your analysis is complete, write one paragraph answering this question: what do these results actually say about my research question? Then write a second paragraph: what do they not say, and what are the limitations? A finding that your survey showed a correlation between two variables does not prove causation. A qualitative theme that appeared in six out of ten interviews is not universal. Precision in interpretation is what separates publishable work from a school project.
Step 6: Connect your findings to the existing literature. Your analysis does not exist in isolation. In your discussion section, you will need to explain whether your findings support, contradict, or extend what previous researchers found. This connection should be visible in how you frame your results. If your data shows a pattern that matches a study you cited in your literature review, name it. If your results differ, explain why they might. This is the step that demonstrates genuine scholarly thinking and it is the step most high school students omit entirely.
The single most common mistake at this stage is reporting results without interpreting them. A table of numbers is not a finding. A finding is a specific, evidence-backed answer to a specific question. Every result you report must be followed by a sentence explaining what it means.
Where most high school students get stuck with data analysis
The first sticking point is choosing the wrong statistical test. Most high school students have studied basic statistics in mathematics class, but academic research uses a wider range of tests, each with specific conditions that must be met before the test is valid. Applying a t-test to data that does not meet the assumption of normal distribution, for example, produces results that reviewers will immediately flag. Without guidance, most students do not know these conditions exist.
The second sticking point is interpreting qualitative data without a systematic coding framework. Reading through interview transcripts and noting what seems interesting is not thematic analysis. Thematic analysis requires a defined coding process, inter-rater reliability checks where possible, and a clear audit trail from raw data to reported theme. Students working alone almost always skip the audit trail, which makes their qualitative findings impossible to verify.
The third sticking point is scope. High school students frequently attempt analysis that their sample size cannot support. Drawing conclusions about a population from 15 survey responses is not statistically defensible, and a PhD mentor will redirect this before the paper is written, not after a journal rejection.
A PhD mentor intervenes at each of these points with specific, experience-based guidance. They know which test fits which question. They have run thematic analysis on dozens of datasets. And they will tell a student directly when a conclusion is overreaching, which is information a student working alone has no way to access.
If you are at this stage and want a PhD mentor to guide you through data analysis and the full research process, book a free 20-minute Research Assessment to see what is possible before the Summer 2026 Priority Deadline.
What does good data analysis look like? A high school example
Answer Capsule: Strong data analysis in a high school research paper states the method chosen and why, reports results with precise values, and interprets findings in direct relation to the research question. Weak analysis reports averages without context, skips the rationale for method choice, and draws conclusions the data cannot support.
Consider a student researching the relationship between sleep duration and academic performance in Grade 11 students.
Weak example: "Most students who slept more did better on tests. The average sleep was 6.8 hours. This shows that sleep is important for grades."
Strong example: "A Pearson correlation analysis (r = 0.61, p = 0.003, n = 45) indicated a moderate positive correlation between nightly sleep duration and standardised test scores among Grade 11 participants. This finding aligns with Zhang et al. (2020), who reported a similar correlation (r = 0.58) in a comparable adolescent sample. The correlation does not establish causation; students with higher academic motivation may manage both sleep and study time more effectively, which represents a confounding variable not controlled for in this study."
The strong example names the specific test used, reports the exact statistical values, acknowledges the sample size, connects the finding to prior literature, and identifies a limitation. It does not overstate what the data shows. Every one of those elements is expected in a peer-reviewed submission and each one is absent from the weak example. The difference is not intelligence. It is knowing the standard and being trained to meet it. Students who have worked through the RISE Research project process produce analysis sections that meet this standard consistently.
The best tools for data analysis as a high school student
JASP is a free, open-source statistical software package developed by the University of Amsterdam. It handles t-tests, ANOVA, correlation, regression, and Bayesian analysis through a clean drag-and-drop interface. It is the most accessible tool for high school students doing quantitative analysis and produces output that can be cited in a paper. The limitation is that it requires a basic understanding of which test to select before you open it.
Google Sheets is sufficient for descriptive statistics including mean, median, standard deviation, and basic charts. It is free, cloud-based, and shareable with a mentor for real-time review. Its limitation is that it does not support more advanced inferential tests, so it works best for exploratory analysis or simple comparisons.
MAXQDA offers a free trial and is one of the most widely used qualitative analysis tools in academic research. It allows students to import interview transcripts, apply codes, and visualise the frequency and relationship between themes. The learning curve is moderate but the output is structured and defensible in a paper.
Google Scholar is not an analysis tool, but it is essential during the interpretation step. When you are connecting your findings to existing literature, Google Scholar allows you to search for papers that reported similar results, check how other researchers interpreted comparable data, and verify that your conclusions are grounded in the field. It is free and indexes most major academic journals.
Zotero is a free reference manager that becomes critical during the analysis and write-up stages when you are citing the studies your findings relate to. It integrates with Google Docs and Microsoft Word and formats citations automatically in APA, MLA, Chicago, and other styles. Students who do not use a reference manager at this stage typically produce citation errors that reviewers flag immediately. For students aiming at publication in journals listed in the Journal of Student Research guide, accurate citations are non-negotiable.
Frequently asked questions about data analysis for high school students
How do I know which statistical test to use for my research?
The choice of statistical test depends on three factors: your research question, the type of data you collected, and the number of groups you are comparing. If you are comparing two groups on a continuous variable, a t-test is typically appropriate. If you are looking at a relationship between two continuous variables, use Pearson correlation. If your data is non-normal or ordinal, use non-parametric equivalents such as the Mann-Whitney U test. JASP includes a built-in guide that walks you through test selection based on your data type.
Can I analyse research data without knowing advanced statistics?
Yes, but only up to a point. Descriptive statistics including mean, median, and standard deviation require only basic maths and are sufficient for some research questions. However, most peer-reviewed journals expect inferential statistics that test whether your findings are statistically significant. Learning to run and interpret a t-test or correlation in JASP is achievable for a motivated high school student, particularly with mentor guidance. Attempting advanced multivariate analysis without training typically produces errors that reviewers catch immediately.
How many survey responses do I need for valid data analysis?
The minimum sample size depends on the statistical test you plan to use and the effect size you expect to detect. As a general rule, fewer than 30 responses make most inferential statistics unreliable. For a simple correlation or t-test, 30 to 50 responses allow for basic analysis with appropriate caveats. For more complex designs, sample size calculators such as G*Power (free) can determine the minimum needed before you begin data collection, which is the correct time to make this decision.
What is thematic analysis and how do I do it for qualitative data?
Thematic analysis is a method for identifying, analysing, and reporting patterns within qualitative data such as interview transcripts or open-ended survey responses. The process involves reading through your data multiple times, assigning codes to meaningful segments, grouping codes into broader themes, and reviewing whether those themes accurately represent the dataset. Braun and Clarke's six-phase framework is the most widely cited approach and is accessible to high school students. MAXQDA supports this process with digital coding tools.
How do I report data analysis results in a research paper?
Results should be reported in a dedicated results section that states what the analysis found without interpretation. For quantitative data, report the test used, the test statistic, the degrees of freedom, the p-value, and the effect size. For example: t(43) = 2.31, p = 0.025, d = 0.68. For qualitative data, present each theme with supporting quotes from your data. Interpretation of what the results mean belongs in the discussion section, not the results section. Mixing these two sections is one of the most common structural errors in high school research papers.
Conclusion
Data analysis is the step that determines whether your research produces findings or just produces data. The three things that matter most are choosing the right method for your research question, reporting results with precision and appropriate statistical values, and interpreting findings honestly within the limits of what your data can support. These are learnable skills. They are also skills that take time to develop without feedback from someone who has done them before.
Knowing how to analyse research data as a high school student is not just about completing a project. It is about producing work that holds up in a peer-reviewed journal and that demonstrates genuine scholarly capability to university admissions readers. The outcomes achieved by RISE Research scholars reflect what becomes possible when students approach this stage with structured expert support. The Summer 2026 Priority Deadline is approaching. If data analysis is a step you want to get right with expert guidance behind you, schedule a free Research Assessment and we will match you with a PhD mentor who has done this in your subject area.
Read More