All data that were used to carry out your research should be cited to ensure reproducibility of your work. This includes both data that you generated and any data that were previously published (by you or someone else) and re-used in a given study.
Data should be cited in the same fashion as a journal article; an abbreviated in-text citation should be included wherever appropriate in the main text, and a full reference should be included in the reference. This includes both data that you generated and any data that were previously published (by you or someone else).
Many of the major reference formats that are used by publishers have specific reference/citation formats for data/datasets, in addition to the better-known styles for journal articles, books, preprints, etc. (e.g., APA). A journal may also have its own specification for how to cite a dataset. In general, dataset references should contain:
Additional information like the dataset version (if relevant) and a bracketed description (e.g., "[Data set]") may also be recommended or required. The in-text citation should be formatted like articles or books (e.g., Smith et al., 2025).
Software should be cited in the same fashion as a journal article; an abbreviated in-text citation should be included wherever appropriate in the main text, and a full reference should be included in the reference. This includes both software that you produced and any software that were previously released (by you or someone else).
Software citation formats may vary by publisher so it is important to following the recommended citation style if one exists. If you find that there is no publisher specific software citation guidance for you to follow, check to see if the developers of the software you have utilized have provided a recommended citation format. If you cannot find specific guidance from the publisher you are working with or from the software developers themselves, a good option is to adhere to the Force11 software citation guidelines and include your software citations in your list of references alongside citations for articles, books, datasets, and other materials referenced by your work.
Oftentimes, when researchers are reusing data or software that were previously published, they cite the associated paper that first described the data or software. However, this is a poor scholarly practice for several reasons:
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 Generic License.