cmipcite.citations#

Citation support

Classes:

Name	Description
`AuthorListStyle`	Author list style
`DOIGranularity`	DOI granularity
`FormatOption`	Citation format options

Functions:

Name	Description
`get`	Get citations without duplicates from CMIP files or tracking IDs or PIDs
`get_bibtex_citation`	Get bibtex citation
`get_citations`	Get citations that apply to the given IDs or paths
`get_doi_and_version`	Get DOI and version for a given ID or path to a netCDF file
`get_text_citation`	Get text citation
`get_tracking_id_from_cmip_netcdf`	Get tracking ID from a CMIP netCDF file
`translate_get_args_to_get_citations_kwargs`	Translate the arguments of (m).get to arguments needed by (m).get_citations

AuthorListStyle #

Bases: StrEnum

Author list style

Attributes:

Name	Type	Description
`LONG`		Long i.e. list all names
`SHORT`		Short i.e. use "et al."

Source code in src/cmipcite/citations.py

class AuthorListStyle(StrEnum):
    """
    Author list style
    """

    SHORT = "short"
    """
    Short i.e. use "et al."
    """

    LONG = "long"
    """
    Long i.e. list all names
    """

LONG `class-attribute` `instance-attribute` #

LONG = 'long'

Long i.e. list all names

SHORT `class-attribute` `instance-attribute` #

SHORT = 'short'

Short i.e. use "et al."

DOIGranularity #

Bases: StrEnum

DOI granularity

CMIP data can be aggregated at different granularities. Data citations are designated on data aggregations belonging to a model contribution to a MIP (or activity_id) and on data belonging to an experiment contributed by a specific model: model: /// experiment: ////.

Attributes:

Name	Type	Description
`EXPERIMENT`		mip-model-experiment granularity of DOI.
`MODEL`		mip-model granularity of DOI.

Source code in src/cmipcite/citations.py

class DOIGranularity(StrEnum):
    """
    DOI granularity

    CMIP data can be aggregated at different granularities.
    Data citations are designated on data aggregations belonging to a model
    contribution to a MIP (or activity_id) and on data belonging to an experiment
    contributed by a specific model:
    model: <mip_era>/<activity_id>/<institution_id>/<source_id>
    experiment: <mip_era>/<activity_id>/<institution_id>/<source_id>/<experiment_id>.
    """

    EXPERIMENT = "experiment"
    """
    mip-model-experiment granularity of DOI.
    """

    MODEL = "model"
    """
    mip-model granularity of DOI.
    """

EXPERIMENT `class-attribute` `instance-attribute` #

EXPERIMENT = 'experiment'

mip-model-experiment granularity of DOI.

MODEL `class-attribute` `instance-attribute` #

MODEL = 'model'

mip-model granularity of DOI.

FormatOption #

Bases: StrEnum

Citation format options

Attributes:

Name	Type	Description
`BIBTEX`		Bibtex format
`TEXT`		Plain text file

Source code in src/cmipcite/citations.py

class FormatOption(StrEnum):
    """
    Citation format options
    """

    BIBTEX = "bibtex"
    """
    Bibtex format
    """

    TEXT = "text"
    """
    Plain text file
    """

BIBTEX `class-attribute` `instance-attribute` #

BIBTEX = 'bibtex'

Bibtex format

TEXT `class-attribute` `instance-attribute` #

TEXT = 'text'

Plain text file

get #

get(
    in_values: list[str],
    format: FormatOption = TEXT,
    author_list_style: AuthorListStyle = LONG,
    doi_granularity: DOIGranularity = MODEL,
    multi_dataset_handling: MultiDatasetHandlingStrategy
    | None = None,
    handle_server_url: str = "http://hdl.handle.net/",
) -> list[str]

Get citations without duplicates from CMIP files or tracking IDs or PIDs

This function mirrors the CLI get command as closely as possible.

Parameters:

Name	Type	Description	Default
`in_values`	`list[str]`	Tracking IDs, PIDs or file paths for which to generate citations. Paths should point to a CMIP file with a `tracking_id` global attribute.	required
`format`	`FormatOption`	Format in which to retrieve the citations	`TEXT`
`author_list_style`	`AuthorListStyle`	Whether, if the format is text, the author list should be long (all names) or short (et al.)	`LONG`
`doi_granularity`	`DOIGranularity`	Granularity of DOI to retrieve. See DOIGranularity for details.	`MODEL`
`multi_dataset_handling`	`MultiDatasetHandlingStrategy \| None`	Strategy to use when a given ID or file belongs to multiple datasets	`None`
`handle_server_url`	`str`	URL of the server to use for handling tracking IDs i.e. handles If not supplied, a new client with a default handle server URL is instantiated.	`'http://hdl.handle.net/'`

Returns:

Type	Description
`list[str]`	Retrieved citations for `in_values`

Notes

Citations can be retrieved with the help of the Persistent IDentifiers (PIDs). In the CMIP world, there are two types of PIDs:

file PID (normally referred to as a tracking ID)
dataset PID (normally simply referred to as PID).

A dataset is a collection of files (for CMIP, this collection of files is for a single variable sampled at a single frequency and spatial sampling from a single model running a single experiment). Both PID types can be passed to in_values.

For a given PID, we can retrieve an associated DOI. However, there are multiple possibilities for the retrieved DOI. These vary based on the granularity of the DOI. At the moment, as far as we know, there are two granularities: * model (capturing all submissions to a given MIP by a given model) * experiment (capturing all submissions to a given MIP by a given model for a given experiment. This is controlled by doi_granularity.

Source code in src/cmipcite/citations.py

def get(  # noqa: PLR0913
    in_values: list[str],
    format: FormatOption = FormatOption.TEXT,
    author_list_style: AuthorListStyle = AuthorListStyle.LONG,
    doi_granularity: DOIGranularity = DOIGranularity.MODEL,
    multi_dataset_handling: MultiDatasetHandlingStrategy | None = None,
    handle_server_url: str = "http://hdl.handle.net/",
) -> list[str]:
    """
    Get citations without duplicates from CMIP files or tracking IDs or PIDs

    This function mirrors the CLI `get` command as closely as possible.

    Parameters
    ----------
    in_values
        Tracking IDs, PIDs or file paths for which to generate citations.
        Paths should point to a CMIP file with a `tracking_id` global attribute.

    format
        Format in which to retrieve the citations

    author_list_style
        Whether, if the format is text,
        the author list should be long (all names) or short (et al.)

    doi_granularity
        Granularity of DOI to retrieve.

        See [DOIGranularity][(m).] for details.

    multi_dataset_handling
        Strategy to use when a given ID or file belongs to multiple datasets

    handle_server_url
        URL of the server to use for handling tracking IDs i.e. handles
        If not supplied, a new client with a default handle server URL
        is instantiated.

    Returns
    -------
    :
        Retrieved citations for `in_values`

    Notes
    -----
    Citations can be retrieved with the help of the Persistent IDentifiers (PIDs).
    In the CMIP world, there are two types of PIDs:

       * file PID (normally referred to as a tracking ID)
       * dataset PID (normally simply referred to as PID).

    A dataset is a collection of files
    (for CMIP, this collection of files
    is for a single variable sampled at a single frequency and spatial sampling
    from a single model running a single experiment).
    Both PID types can be passed to `in_values`.

    For a given PID, we can retrieve an associated DOI.
    However, there are multiple possibilities for the retrieved DOI.
    These vary based on the granularity of the DOI.
    At the moment, as far as we know, there are two granularities:
        * model (capturing all submissions to a given MIP by a given model)
        * experiment (capturing all submissions to a given MIP by a given model for a
        given experiment.
    This is controlled by `doi_granularity`.
    """
    get_citations_kwargs = translate_get_args_to_get_citations_kwargs(
        format=format,
        author_list_style=author_list_style,
        handle_server_url=handle_server_url,
    )

    try:
        citations = get_citations(
            ids_or_paths=in_values,
            multi_dataset_handling=multi_dataset_handling,
            doi_granularity=doi_granularity,
            **get_citations_kwargs,
        )
    except MultipleDatasetMemberError as exc:
        msg = (
            "One of your input values is a member of more than one dataset. "
            "You can resolve this by passing a value for the "
            "`multi_dataset_handling` argument. "
            "In most cases, adding "
            "`from cmipcite.tracking_id import MultiDatasetHandlingStrategy` "
            "and then using "
            "`multi_dataset_handling=MultiDatasetHandlingStrategy.LATEST` "
            "is what you will want "
            "(this will give you the reference to the last published dataset "
            "that includes your ID)."
        )
        raise ValueError(msg) from exc

    return citations

get_bibtex_citation #

get_bibtex_citation(doi: str, version: str) -> str

Get bibtex citation

The version is added to the title field.

Parameters:

Name	Type	Description	Default
`doi`	`str`	DOI for which to get the citation	required
`version`	`str`	Version of the dataset associated with `doi`	required

Returns:

Type	Description
`str`	Bibtex citation

Source code in src/cmipcite/citations.py

def get_bibtex_citation(doi: str, version: str) -> str:
    """
    Get bibtex citation

    The version is added to the title field.

    Parameters
    ----------
    doi
        DOI for which to get the citation

    version
        Version of the dataset associated with `doi`

    Returns
    -------
    :
        Bibtex citation
    """
    url = "http://dx.doi.org/" + doi
    headers = {"accept": "application/x-bibtex"}
    r = httpx.get(url, headers=headers, follow_redirects=True)

    bib = r.raise_for_status().text

    # add version to title
    citation = re.sub(
        r"title = {(.*?)}",
        lambda m: f"title = {{{m.group(1)}. Version {version}.}}",
        bib,
    )

    return citation

get_citations #

get_citations(
    ids_or_paths: list[str],
    get_citation: Callable[[str, str], str],
    doi_granularity: DOIGranularity,
    client: RESTHandleClient | None = None,
    multi_dataset_handling: MultiDatasetHandlingStrategy
    | None = None,
) -> list[str]

Get citations that apply to the given IDs or paths

If your IDs are tracking IDs or paths, then two or more IDs/paths can share the same citation. This function returns the minimum set of citations required i.e. any duplicate citations are removed.

Parameters:

Name	Type	Description	Default
`ids_or_paths`	`list[str]`	Tracking ids (file PID), dataset PIDs and paths for which to get citations. Tracking ids identify files. To date, they can be found in the `tracking_id` global attribute of CMIP netCDF files. PIDs identify datasets (a group of files). Paths should point to a CMIP file with a `tracking_id` global attribute.	required
`get_citation`	`Callable[[str, str], str]`	Function which, given a DOI and a version, produces a citation For example, get_bibtex_citation.	required
`doi_granularity`	`DOIGranularity`	Granularity of DOI to retrieve. See DOIGranularity for details.	required
`client`	`RESTHandleClient \| None`	Client to use for interacting with pyhandle's REST API If not supplied, a new client with a default handle server URL is instantiated.	`None`
`multi_dataset_handling`	`MultiDatasetHandlingStrategy \| None`	What to do in the case that the tracking ID belongs to multiple datasets i.e. is associated with more than one PID. Passed to get_dataset_pid.	`None`

Returns:

Type	Description
`list[str]`	Citations for the given `ids_or_paths`

Notes

Citations can be retrieved with the help of the Persistent IDentifiers (PIDs). In the CMIP world, there are two types of PIDs:

file PID (normally referred to as a tracking ID)
dataset PID (normally simply referred to as PID).

A dataset is a collection of files (for CMIP, this collection of files is for a single variable sampled at a single frequency and spatial sampling from a single model running a single experiment). Both PID types can be passed to ids_or_paths.

For a given PID, we can retrieve an associated DOI. However, there are multiple possibilities for the retrieved DOI. These vary based on the granularity of the DOI. At the moment, as far as we know, there are two granularities: * model (capturing all submissions to a given MIP by a given model) * experiment (capturing all submissions to a given MIP by a given model for a given experiment. This is controlled by doi_granularity.

Examples:

>>> citations = get_citations(
...     ["hdl:21.14100/f2f502c9-9626-31c6-b016-3f7c0534803b"],
...     doi_granularity=DOIGranularity.MODEL,
...     get_citation=get_bibtex_citation,
... )
>>> print(citations[0])
@misc{https://doi.org/10.22033/esgf/cmip6.742,
  doi = {10.22033/ESGF/CMIP6.742},
  url = {http://cera-www.dkrz.de/WDCC/meta/CMIP6/CMIP6.CMIP.MPI-M.MPI-ESM1-2-LR},
  author = {Wieners, Karl-Hermann and Giorgetta, Marco and Jungclaus, Johann and Reick, Christian and Esch, Monika and Bittner, Matthias and Legutke, Stephanie and Schupfner, Martin and Wachsmann, Fabian and Gayler, Veronika and Haak, Helmuth and de Vrese, Philipp and Raddatz, Thomas and Mauritsen, Thorsten and von Storch, Jin-Song and Behrens, Jörg and Brovkin, Victor and Claussen, Martin and Crueger, Traute and Fast, Irina and Fiedler, Stephanie and Hagemann, Stefan and Hohenegger, Cathy and Jahns, Thomas and Kloster, Silvia and Kinne, Stefan and Lasslop, Gitta and Kornblueh, Luis and Marotzke, Jochem and Matei, Daniela and Meraner, Katharina and Mikolajewicz, Uwe and Modali, Kameswarrao and Müller, Wolfgang and Nabel, Julia and Notz, Dirk and Peters-von Gehlen, Karsten and Pincus, Robert and Pohlmann, Holger and Pongratz, Julia and Rast, Sebastian and Schmidt, Hauke and Schnur, Reiner and Schulzweida, Uwe and Six, Katharina and Stevens, Bjorn and Voigt, Aiko and Roeckner, Erich},
  keywords = {CMIP6, climate, CMIP6.CMIP.MPI-M.MPI-ESM1-2-LR},
  language = {en},
  title = {MPI-M MPIESM1.2-LR model output prepared for CMIP6 CMIP. Version 20211412.},
  publisher = {Earth System Grid Federation},
  year = {2019},
  copyright = {Creative Commons Attribution 4.0 International}
}

Source code in src/cmipcite/citations.py

def get_citations(  # type: ignore
    ids_or_paths: list[str],
    get_citation: Callable[[str, str], str],
    doi_granularity: DOIGranularity,
    client: RESTHandleClient | None = None,
    multi_dataset_handling: MultiDatasetHandlingStrategy | None = None,
) -> list[str]:
    """
    Get citations that apply to the given IDs or paths

    If your IDs are tracking IDs or paths,
    then two or more IDs/paths can share the same citation.
    This function returns the minimum set of citations required
    i.e. any duplicate citations are removed.

    Parameters
    ----------
    ids_or_paths
        Tracking ids (file PID), dataset PIDs and paths for which to get citations.

        Tracking ids identify files.
        To date, they can be found
        in the `tracking_id` global attribute of CMIP netCDF files.

        PIDs identify datasets (a group of files).

        Paths should point to a CMIP file with a `tracking_id` global attribute.

    get_citation
        Function which, given a DOI and a version, produces a citation

        For example, [get_bibtex_citation][(m).].

    doi_granularity
        Granularity of DOI to retrieve.

        See [DOIGranularity][(m).] for details.

    client
        Client to use for interacting with pyhandle's REST API

        If not supplied, a new client with a default handle server URL
        is instantiated.

    multi_dataset_handling
        What to do in the case that the tracking ID belongs to multiple datasets
        i.e. is associated with more than one PID.

        Passed to [get_dataset_pid][(p).tracking_id.get_dataset_pid].

    Returns
    -------
    :
        Citations for the given `ids_or_paths`

    Notes
    -----
    Citations can be retrieved with the help of the Persistent IDentifiers (PIDs).
    In the CMIP world, there are two types of PIDs:

       * file PID (normally referred to as a tracking ID)
       * dataset PID (normally simply referred to as PID).

    A dataset is a collection of files
    (for CMIP, this collection of files
    is for a single variable sampled at a single frequency and spatial sampling
    from a single model running a single experiment).
    Both PID types can be passed to `ids_or_paths`.

    For a given PID, we can retrieve an associated DOI.
    However, there are multiple possibilities for the retrieved DOI.
    These vary based on the granularity of the DOI.
    At the moment, as far as we know, there are two granularities:
        * model (capturing all submissions to a given MIP by a given model)
        * experiment (capturing all submissions to a given MIP by a given model for a
        given experiment.
    This is controlled by `doi_granularity`.

    Examples
    --------
    >>> citations = get_citations(
    ...     ["hdl:21.14100/f2f502c9-9626-31c6-b016-3f7c0534803b"],
    ...     doi_granularity=DOIGranularity.MODEL,
    ...     get_citation=get_bibtex_citation,
    ... )
    >>> print(citations[0])
    @misc{https://doi.org/10.22033/esgf/cmip6.742,
      doi = {10.22033/ESGF/CMIP6.742},
      url = {http://cera-www.dkrz.de/WDCC/meta/CMIP6/CMIP6.CMIP.MPI-M.MPI-ESM1-2-LR},
      author = {Wieners, Karl-Hermann and Giorgetta, Marco and Jungclaus, Johann and Reick, Christian and Esch, Monika and Bittner, Matthias and Legutke, Stephanie and Schupfner, Martin and Wachsmann, Fabian and Gayler, Veronika and Haak, Helmuth and de Vrese, Philipp and Raddatz, Thomas and Mauritsen, Thorsten and von Storch, Jin-Song and Behrens, Jörg and Brovkin, Victor and Claussen, Martin and Crueger, Traute and Fast, Irina and Fiedler, Stephanie and Hagemann, Stefan and Hohenegger, Cathy and Jahns, Thomas and Kloster, Silvia and Kinne, Stefan and Lasslop, Gitta and Kornblueh, Luis and Marotzke, Jochem and Matei, Daniela and Meraner, Katharina and Mikolajewicz, Uwe and Modali, Kameswarrao and Müller, Wolfgang and Nabel, Julia and Notz, Dirk and Peters-von Gehlen, Karsten and Pincus, Robert and Pohlmann, Holger and Pongratz, Julia and Rast, Sebastian and Schmidt, Hauke and Schnur, Reiner and Schulzweida, Uwe and Six, Katharina and Stevens, Bjorn and Voigt, Aiko and Roeckner, Erich},
      keywords = {CMIP6, climate, CMIP6.CMIP.MPI-M.MPI-ESM1-2-LR},
      language = {en},
      title = {MPI-M MPIESM1.2-LR model output prepared for CMIP6 CMIP. Version 20211412.},
      publisher = {Earth System Grid Federation},
      year = {2019},
      copyright = {Creative Commons Attribution 4.0 International}
    }
    """  # noqa: E501
    if client is None:  # pragma: no cover
        client = RESTHandleClient(handle_server_url="http://hdl.handle.net/")

    doi_versions = [
        get_doi_and_version(
            v,
            client=client,
            multi_dataset_handling=multi_dataset_handling,
            doi_granularity=doi_granularity,
        )
        for v in ids_or_paths
    ]

    doi_versions_unique = set(doi_versions)

    res = [get_citation(doi, version) for doi, version in doi_versions_unique]

    return res

get_doi_and_version #

get_doi_and_version(
    in_value: str,
    doi_granularity: DOIGranularity,
    client: RESTHandleClient | None = None,
    get_tracking_id_from_path: Callable[
        [Path], str
    ] = get_tracking_id_from_cmip_netcdf,
    multi_dataset_handling: MultiDatasetHandlingStrategy
    | None = None,
) -> tuple[str, str]

Get DOI and version for a given ID or path to a netCDF file

Parameters:

Name	Type	Description	Default
`in_value`	`str`	Input ID or path to a netCDF file	required
`doi_granularity`	`DOIGranularity`	Granularity of DOI to retrieve. We use the 'lowest-level' from the DRS as a short-hand. "experiment" is short for mip-model-experiment. "model" is short for mip-model. See DOIGranularity for details.	required
`client`	`RESTHandleClient \| None`	Client to use for interacting with pyhandle's REST API If not supplied, a new client with a default handle server URL is instantiated.	`None`
`get_tracking_id_from_path`	`Callable[[Path], str]`	Function which, given a path outputs the tracking ID	`get_tracking_id_from_cmip_netcdf`
`multi_dataset_handling`	`MultiDatasetHandlingStrategy \| None`	What to do in the case that the tracking ID belongs to multiple datasets i.e. is associated with more than one PID. Passed to get_dataset_pid.	`None`

Returns:

Name	Type	Description
`doi`	`str`	DOI that applies to `in_value`
`version`	`str`	Version that applies to `in_value`

Source code in src/cmipcite/citations.py

def get_doi_and_version(  # type: ignore
    in_value: str,
    doi_granularity: DOIGranularity,
    client: RESTHandleClient | None = None,
    get_tracking_id_from_path: Callable[[Path], str] = get_tracking_id_from_cmip_netcdf,
    multi_dataset_handling: MultiDatasetHandlingStrategy | None = None,
) -> tuple[str, str]:
    """
    Get DOI and version for a given ID or path to a netCDF file

    Parameters
    ----------
    in_value
        Input ID or path to a netCDF file

    doi_granularity
        Granularity of DOI to retrieve.

        We use the 'lowest-level' from the DRS as a short-hand.
        "experiment" is short for mip-model-experiment.
        "model" is short for mip-model.

        See [DOIGranularity][(m).] for details.

    client
        Client to use for interacting with pyhandle's REST API

        If not supplied, a new client with a default handle server URL
        is instantiated.

    get_tracking_id_from_path
        Function which, given a path outputs the tracking ID

    multi_dataset_handling
        What to do in the case that the tracking ID belongs to multiple datasets
        i.e. is associated with more than one PID.

        Passed to [get_dataset_pid][(p).tracking_id.get_dataset_pid].

    Returns
    -------
    doi :
        DOI that applies to `in_value`

    version :
        Version that applies to `in_value`
    """
    if client is None:  # pragma: no cover
        client = RESTHandleClient(handle_server_url="http://hdl.handle.net/")

    if Path(in_value).exists():
        tracking_id = get_tracking_id_from_path(Path(in_value))
        id_in_value = tracking_id.replace("hdl:", "")
        id_is_tracking_id = True

    else:
        id_in_value = in_value.replace("hdl:", "")

        agg_lev = client.get_value_from_handle(id_in_value, "AGGREGATION_LEVEL")
        if agg_lev == "DATASET":
            id_is_tracking_id = False

        elif agg_lev == "FILE":
            id_is_tracking_id = True

        else:  # pragma: no cover
            msg = f"The id {id_in_value} has an unknown AGGREGATION_LEVEL: {agg_lev}"
            raise NotImplementedError(msg)

    if id_is_tracking_id:
        pid = get_dataset_pid(
            tracking_id=id_in_value,
            multi_dataset_handling=multi_dataset_handling,
            client=client,
        )

    else:
        pid = id_in_value

    doi_raw = client.get_value_from_handle(pid, "IS_PART_OF")
    doi = doi_raw.replace("doi:", "")

    if doi_granularity == DOIGranularity.MODEL:
        # get model doi
        r = httpx.get(
            f"https://api.datacite.org/dois/{doi}",
            follow_redirects=True,
        )
        doi = r.raise_for_status().json()["data"]["attributes"]["container"][
            "identifier"
        ]

    elif doi_granularity == DOIGranularity.EXPERIMENT:
        # doi is already in the desired form
        pass

    else:  # pragma: no cover
        raise NotImplementedError(doi_granularity)

    version = client.get_value_from_handle(pid, "VERSION_NUMBER")

    return (doi, version)

get_text_citation #

get_text_citation(
    doi: str,
    version: str,
    author_list_style: AuthorListStyle,
) -> str

Get text citation

Parameters:

Name	Type	Description	Default
`doi`	`str`	DOI for which to get the citation	required
`version`	`str`	Version of the dataset associated with `doi`	required
`author_list_style`	`AuthorListStyle`	Style to use for the author list	required

Returns:

Type	Description
`str`	Plain text citation

Source code in src/cmipcite/citations.py

def get_text_citation(
    doi: str, version: str, author_list_style: AuthorListStyle
) -> str:
    """
    Get text citation

    Parameters
    ----------
    doi
        DOI for which to get the citation

    version
        Version of the dataset associated with `doi`

    author_list_style
        Style to use for the author list

    Returns
    -------
    :
        Plain text citation
    """
    r = httpx.get(f"https://api.datacite.org/dois/{doi}", follow_redirects=True)
    data = r.raise_for_status().json()["data"]["attributes"]

    if author_list_style == AuthorListStyle.SHORT:
        if len(data["creators"]) == 1:
            creators = data["creators"][0]["name"]

        else:
            creators = f"{data['creators'][0]['familyName']} et al."

    elif author_list_style == AuthorListStyle.LONG:
        creators = "; ".join([c["name"] for c in data["creators"]])

    else:  # pragma: no cover
        raise NotImplementedError(author_list_style)

    citation = (
        f"{creators} ({data['publicationYear']}): {data['titles'][0]['title']}. "
        f"Version {version}. {data['publisher']}. https://doi.org/{doi}."
    )

    return citation

get_tracking_id_from_cmip_netcdf #

get_tracking_id_from_cmip_netcdf(nc_path: Path) -> str

Get tracking ID from a CMIP netCDF file

Parameters:

Name	Type	Description	Default
`nc_path`	`Path`	Path to the CMIP netCDF file. The file must have a `tracking_id` global attribute.	required

Returns:

Type	Description
`str`	Tracking ID

Source code in src/cmipcite/citations.py

def get_tracking_id_from_cmip_netcdf(nc_path: Path) -> str:
    """
    Get tracking ID from a CMIP netCDF file

    Parameters
    ----------
    nc_path
        Path to the CMIP netCDF file.

        The file must have a `tracking_id` global attribute.

    Returns
    -------
    :
        Tracking ID
    """
    try:
        import netCDF4
    except ImportError as exc:
        raise MissingOptionalDependencyError(
            "get_tracking_id_from_cmip_netcdf", requirement="netCDF4"
        ) from exc

    with netCDF4.Dataset(nc_path) as ds:
        tracking_id = ds.getncattr("tracking_id")

    return str(tracking_id)

translate_get_args_to_get_citations_kwargs #

translate_get_args_to_get_citations_kwargs(
    format: FormatOption,
    author_list_style: AuthorListStyle,
    handle_server_url: str = "http://hdl.handle.net/",
) -> dict[str, Any]

Translate the arguments of (m).get to arguments needed by (m).get_citations

(m).get_citations is a lower-level function that supports dependency injection. (m).get is meant to mirror the equivalent command-line interface command, therefore has to work with more primitive types and does not support (direct) dependency injection.

Parameters:

Name	Type	Description	Default
`format`	`FormatOption`	Format in which to retrieve the citations	required
`author_list_style`	`AuthorListStyle`	Whether, if the format is text, the author list should be long (all names) or short (et al.)	required
`handle_server_url`	`str`	URL of the server to use for handling tracking IDs i.e. handles	`'http://hdl.handle.net/'`

Returns:

Type	Description
`dict[str, Any]`	Keyword arguments which can be passed to (m).get_citations

Source code in src/cmipcite/citations.py

def translate_get_args_to_get_citations_kwargs(
    format: FormatOption,
    author_list_style: AuthorListStyle,
    handle_server_url: str = "http://hdl.handle.net/",
) -> dict[str, Any]:
    """
    Translate the arguments of [(m).get][] to arguments needed by [(m).get_citations][]

    [(m).get_citations][] is a lower-level function that supports dependency injection.
    [(m).get][] is meant to mirror the equivalent command-line interface command,
    therefore has to work with more primitive types and does not support
    (direct) dependency injection.

    Parameters
    ----------
    format
        Format in which to retrieve the citations

    author_list_style
        Whether, if the format is text,
        the author list should be long (all names) or short (et al.)

    handle_server_url
        URL of the server to use for handling tracking IDs i.e. handles

    Returns
    -------
    :
        Keyword arguments which can be passed to [(m).get_citations][]
    """
    if format == FormatOption.TEXT:
        get_citation: Callable[[str, str], str] = partial(
            get_text_citation, author_list_style=author_list_style
        )

    elif format == FormatOption.BIBTEX:
        get_citation = get_bibtex_citation

    else:  # pragma: no cover
        raise NotImplementedError(FormatOption)

    client = RESTHandleClient(handle_server_url=handle_server_url)

    return dict(
        get_citation=get_citation,
        client=client,
    )

cmipcite.citations#

AuthorListStyle #

LONG class-attribute instance-attribute #

SHORT class-attribute instance-attribute #

DOIGranularity #

EXPERIMENT class-attribute instance-attribute #

MODEL class-attribute instance-attribute #

FormatOption #

BIBTEX class-attribute instance-attribute #

TEXT class-attribute instance-attribute #

get #

get_bibtex_citation #

get_citations #

get_doi_and_version #

get_text_citation #

get_tracking_id_from_cmip_netcdf #

translate_get_args_to_get_citations_kwargs #

LONG `class-attribute` `instance-attribute` #

SHORT `class-attribute` `instance-attribute` #

EXPERIMENT `class-attribute` `instance-attribute` #

MODEL `class-attribute` `instance-attribute` #

BIBTEX `class-attribute` `instance-attribute` #

TEXT `class-attribute` `instance-attribute` #