Contributing
Contributions to Biomappings are welcomed and encouraged. Thanks for considering to participate.
All contributors, maintainers, and participants of the Biomappings project are expected to follow our Code of Conduct. This document is organized as follows:
Content Contribution
There are several ways to add new content to Biomappings:
- Write a script that automatically generates new mappings
- Curate mappings using the local web application
- Suggest curations in the issue tracker
Who Can Add New Mappings
New mappings can be added by anyone, even if they are for a resource they do not themselves maintain. A main goal of Biomappings is to fill in the gaps left by primary curation projects, so expertise is welcome from anywhere.
Requirements for New Mappings
- Mappings must use canonical Bioregistry prefixes and local unique identifier standards
- Mappings should not duplicate previously curated work (e.g., from primary resources or other Biomappings curations)
- Mappings should be one-to-one between vocabularies when possible
- Mappings should be properly attributed with ORCID for manual curation or provenance to a script if automatically generated.
Editing Mappings
Sometimes, it becomes clear that a mapping was not correct. If this is the case,
than anyone is free to turn a manually curated mapping into an incorrect
mapping.
However, the original contributor should be contacted (which should be possible
via the git blame
feature on GitHub as well as the ORCID identifier
annotation).
Currently, Biomappings uses a simple format and does not track full change history for each mapping. Therefore, the ORCID identifier should be overwritten by the last person to make a manual curation to the mapping.
Review of Additions and Edits
Review of edits to existing records is handled by the Biomappings Review Team, whose membership and conduct is described in the Biomappings’s Project Governance.
Code Contribution
This project uses the GitHub Flow model for code contributions. Follow these steps:
- Create a fork of the upstream
repository
at
biopragmatics/biomappings
on your GitHub account (or in one of your organizations) - Clone your fork
with
git clone https://github.com/<your namespace here>/biomappings.git
- Make and commit changes to your fork with
git commit
- Push changes to your fork with
git push
- Repeat steps 3 and 4 as needed
- Submit a pull request back to the upstream repository
Merge Model
Biomappings uses squash merges to group all related commits in a given pull request into a single commit upon acceptance and merge into the main branch. This has several benefits:
- Keeps the commit history on the main branch focused on high-level narrative
- Enables people to make lots of small commits without worrying about muddying up the commit history
- Commits correspond 1-to-1 with pull requests
Code Style
This project encourages the use of optional static typing. It
uses mypy
as a type checker
and sphinx_autodoc_typehints
to automatically generate documentation based on type hints. You can check if
your code passes mypy
with tox -e mypy
.
This project uses black
to automatically
enforce a consistent code style. You can apply black
and other pre-configured
linters with tox -e lint
.
This project uses flake8
and several plugins for
additional checks of documentation style, security issues, good variable
nomenclature, and more (see tox.ini
for a list of flake8 plugins). You can check if your
code passes flake8
with tox -e flake8
.
Each of these checks are run on each commit using GitHub Actions as a continuous integration service. Passing all of them is required for accepting a contribution. If you’re unsure how to address the feedback from one of these tools, please say so either in the description of your pull request or in a comment, and we will help you.
Logging
Python’s builtin print()
should not be used (except when writing to files),
it’s checked by the
flake8-print
plugin to flake8
. If
you’re in a command line setting or main()
function for a module, you can use
click.echo()
. Otherwise, you can use the builtin logging
module by adding
logger = logging.getLogger(__name__)
below the imports at the top of your
file.
Documentation
All public functions (i.e., not starting with an underscore _
) must be
documented using the sphinx documentation format.
The darglint
plugin to flake8
reports on functions that are not fully documented.
This project uses sphinx
to automatically build
documentation into a narrative structure. You can check that the documentation
properly builds with tox -e docs-test
.
Testing
Functions in this repository should be unit tested. These can either be written
using the unittest
framework in the tests/
directory or as embedded
doctests. You can check that the unit tests pass with tox -e py
and that the
doctests pass with tox -e doctests
. These tests are required to pass for
accepting a contribution.
Syncing your fork
If other code is updated before your contribution gets merged, you might need to resolve conflicts against the main branch. After cloning, you should add the upstream repository with
$ git remote add biopragmatics https://github.com/biopragmatics/biomappings.git
Then, you can merge upstream code into your branch. You can also use the GitHub UI to do this by following this tutorial.
Python Version Compatibility
This project aims to support all versions of Python that have not passed their
end-of-life dates. After end-of-life, the version will be removed from the Trove
qualifiers in the setup.cfg
and from the GitHub Actions testing configuration.
See https://endoflife.date/python for a timeline of Python release and end-of-life dates.
Review of Pull Requests
Review of edits to existing records is handled by the Biomappings Core Development Team, whose membership and conduct is described in the Biomappings’s Project Governance.