Before Collecting Data
Proper record keeping, controls, ethical practices, data analysis, and AI use are fundamental to a successful research experience and will aid you in any career path.
Documenting Data
- Use our Anatomy of a Research Notebook document to start a conversation with your mentor on how you should manage the data you collect and the additional information you learn during your research experience.
- Browse the Managing Data Resources from NC State University Libraries for more information.
Designing Controls
No matter the type of research being conducted, establishing proper controls ensures the accurate interpretation of results. Visit the categories below to learn more about the range of controls used across fields.
Treatment and Control Groups
Very common in experimental research, treatment and control groups allow researchers to compare and contrast the impact of a specific condition, drug/compound, intervention, etc. aka “treatment” in question. In establishing control groups, all factors should be as equal as possible between the two groups with only one exception, the treatment (condition, drug/compound, interventions, etc. under study).
Positive/Negative Controls
Very common in quantitative research, positive and negative controls allow the researcher to be certain of two critical points.
One, that the experiment, as designed and carried out, is capable of detecting/demonstrating the result/condition/reaction, etc. under question. This comes from using a positive control; adding a factor known to produce a positive result with all other conditions being the same. When a positive control works, it allows researchers to trust that any negative results for the experiment itself are not false negatives.
Two, that the experiment, as designed and carried out, is not contaminated leading to false positives. This comes from using a negative control; running the experiment without a sample or known positive control so that if any positive results are detected/demonstrated they can only be the result of contamination. Thus, when negative controls work, researchers can trust that any positive results for the experiment itself are not false positives.
It is important to note that for any experimental design for which positive/negative controls are needed, every run of the experiment will need positive and negative controls built in otherwise any results without them are questionable.
Confirmation Experiments
Sometimes in research the best experimental designs are still only indirect measures of what researchers are testing. In this situation, it is especially important to utilize multiple, independent* methods to test for the same information. When more than one independent method indicates the same result or conclusion, researchers can be more certain of the outcomes.
*Independent methods are ones that are completely separate from one another. For example, if researchers wanted to test ice cream preferences in a population, two independent methods would be 1) using a survey to collect personal accounts and 2) analyzing the purchase records from stores that sell ice cream.
Multiple Sources
Very common in qualitative research, but also a critical part of any background literature review is examining multiple sources of information to ensure the full wealth of knowledge has been examined. When utilizing multiple sources for information, it’s important to cover a wide breadth as well as a sufficient depth.
When talking with people or observing populations (past or present), researchers should seek information from a wide range of individuals or sources. Using multiple sources (e.g. interviews, historical records, etc.) allows qualitative researchers to triangulate or cross-verify their findings.
When conducting a literature review, researchers should use a wide range of keywords to find articles applicable to your study. Researchers should also make sure to honestly consider articles discussing results that conflict with their own results or assumptions.
Fact and Member Checking
Two common controls in qualitative research, fact checking and member checking allow researchers to introduce rigor into the analysis of people’s stories. Even when opinions are shared, researchers will still have a set of facts that can be verified to confirm the trustworthiness of the source and the accuracy of the information capturing process.
For example, researchers can fact check comments on such things as: the weather; what other witnesses or participants reported; dates and times of known or cataloged occurrences; recordings from the event, day, or time period; and even other documented comments shared by the subject.
Member checking comes into play when researchers reach back out to participants to confirm the accuracy of their notes, transcripts, and their interpretation, as well as verifying the details that can and cannot be shared in places such as publications or reports.
Self Checking
Perhaps the most critical control of all, self checking is where researchers honestly consider their intentions, motivations, and preconceived notions plus the impacts each of these can have on the research in question. Qualitative researchers often make note of these during the data collection and analysis processes (commonly called reflexivity or positionality), however, researchers engaged in any type of research should make use of this control.
Blind and Double-Blind Studies
Anytime there is a possibility that a subject’s and/or data collector’s knowledge of the experiment or treatment conditions can impact the results, blind or double-blind controls should be used. These allow researchers to alleviate those concerns.
In blind controls, the subjects are deliberately made unaware as to which group they’ve been assigned and/or what the researchers are measuring. This way, the subject’s assumptions or expectations about either the treatment or control scenario are not impacting their behavior or responses.
In double-blind controls, the same blind control set up exists with the added condition that the data collector is also unaware as to which group the subject has been assigned. This way the data collectors cannot consciously or unconsciously influence either the subjects themselves or the data interpretation/analysis process.
Random Sampling and Sampling Size
In both quantitative and qualitative research a population under study can be too large to test or examine each individual. Researchers in this situation have to rely on random sampling to cover the range of natural differences within the given population, and a large enough sampling size to ensure adequate representation of the same population.
Researchers should factor statistics (power analysis), logistics, and resources when determining an appropriate sample size. Special steps should also be taken in the methods to make sure the sampling is random. See some sample questions below for consideration:
- With human subjects for example, where is the recruitment information posted?
- What language is used to draw in participants?
- With plant and animal subjects, from where were they sampled?
- What methods were used to capture or collect the samples?
- Is it possible that there are variations within the population that would preclude a subset of individuals from being picked up by the sampling methods?
- What steps could correct that imbalance?
Ethical Considerations
Plagiarism, Authorship, Collaborative Research, Conflicts of Interest, Data Management, Mentoring, Peer Review, Research Misconduct, and Research Security and Transparency are key considerations when conducting research. All student researchers are strongly encouraged to pursue some form of Responsible Conduct of Research training (which covers each of these topics) and additional ethical resources.
Before working with human subjects, their data, or with animal subjects, researchers must obtain specific approvals. The approval processes ensure the research is being conducted ethically without undue harm. Additional training will also be required.
Visit Required Training for guidance on training options and additional resources for the above ethical considerations.
Data Analysis
- Take advantage of the many resources provided by NC State University Libraries.
- Preparing Data
- Analyzing Data
- Sage’s Qualitative Data Analysis Videos
- Accessible through NC State University Libraries’ subscription to Sage. Learn how to gain off-campus access to these and other electronic resources.
- Accessing Published Data
- Geographic Information Systems (GIS)
- Visualizing and Communicating Data
- Spaces
- Data Experience Lab (DXL) in D.H. Hill Jr. Library
- Dataspace in James B. Hunt Library
- Take advantage of the courses and resources offered by the Data Science and AI Academy (DSA).
Responsible AI Use
Generative AI is very powerful, but whether it is an invaluable research tool or the cause of a major disaster with your work depends solely on how it’s used.
Researchers should take the time to thoughtfully consider: how the particular AI program works, what happens to the data shared with it, whether or not there are any risks in sharing the given set of data, the purpose of using AI, any policies surrounding its use for the given purpose, and the multiple concerns that have arisen about its use. The following table helps you consider each of these in more detail before making a decision.
| Don’t Use | Use | Guidance |
|---|---|---|
| Use is prohibited by the entity who will receive the file/information. | Use is allowed by the entity who will receive the file/information. | Always check with entities such as course instructors, scholarship offices/organizations, funding agencies, etc. to learn about their generative AI use polices. |
| Approval has not been obtained or discussed with research mentor in charge. | Approval has specifically been obtained from the research mentor in charge. | Talk to your mentor about 1) which data can be shared with AI programs and 2) which AI programs/analyses are permissible for each type/set of data. |
| It is not safe to share data with the particular AI program or the risks are unknown. | The University (and the research mentor in chage, if relevant) have declared it is safe to share the given data with the particular AI program. | All AI programs are not created equal! They vary greatly regarding what happens to the information shared and how safe it is. Always make sure the AI program you wish to use is approved for your data. |
| You know how to use the given AI program, but don’t know how it works. | You know how the given AI programs works. | Using AI without understanding how the program works is not AI literacy it is AI dependence. Reach out to the Data Science and AI Academy (DSA) and/or the University Libraries (AI for Researchers) for resources and workshops to fill in any knowledge gaps you may have. |
| You need generative AI to perform task you don’t know how to do on your own. | You need generative AI to perform tasks you know how to do on your own, but cannot perform as quickly. | You must be able to critically evaluate the output of your AI use. This requires you to have a true understanding of the task at hand. |
| You will not be confirming or critically evaluating the results/output you receive. | You will be confirming or critically evaluating the results/output you receive. | Generative AI programs can give inaccurate, wrong, and even hallucinated information. You must thoroughly analyze all results/output. |
| You are unaware or are aware and concerned about the impacts/potential impacts of generative AI use (environmental, cognitive, societal, etc.) to the point you are not comfortable using it. | You are aware of the impacts/potential impacts of generative AI use (environmental, cognitive, societal, etc.), and have factored those into your work in such a way as to be comfortable using it. | As responsible researchers it is necessary to be educated about the impact/potential impact your work may have on others (and yourself). Investigate trusted resources for the most up-to-date information on generative AI’s impacts and determine for yourself what, if any, limitations are needed. |