Read

`quicat.read_dna(csv_path)` ¶

This function reads the bulkDNA CSV file from the extract pipeline into a Polars DataFrame, extracts the specified columns as metadata, and creates a sparse matrix from the remaining columns. The resulting AnnData object is prepared with the metadata and count data.

Parameters:

Name	Type	Description	Default
`csv_path`	`str`	Path to the CSV file.	required
`columns`	`list of str`	List of column names to extract as cell metadata.	required

Returns:

Type	Description
`AnnData`	anndata.AnnData: The constructed AnnData object with the sample metadata and count matrix.

Source code in src/quicat/read/read_extract_output.py

def read_dna(csv_path: str) -> sc.AnnData:
    """
    This function reads the bulkDNA CSV file from the extract pipeline into a Polars DataFrame, extracts the specified
    columns as metadata, and creates a sparse matrix from the remaining columns.
    The resulting AnnData object is prepared with the metadata and count data.

    Parameters:
        csv_path (str): Path to the CSV file.
        columns (list of str): List of column names to extract as cell metadata.

    Returns:
        anndata.AnnData: The constructed AnnData object with the sample metadata and count matrix.
    """

    df = pl.read_csv(csv_path)
    metadata_columns_present = [col for col in (METADATA_DNA) if col in df.columns]

    cell_metadata = df.select(metadata_columns_present)
    barcodes = [col for col in df.columns if col not in metadata_columns_present]
    matrix = sp.csr_matrix(df.select(barcodes).to_numpy()).astype(float)

    adata = sc.AnnData(X=matrix, var={"var_names": barcodes}, obs=cell_metadata.to_pandas())
    adata.obs_names = adata.obs["sample"].values

    for col in adata.obs.columns:
        adata.obs[col] = adata.obs[col].astype("category")
    sc.pp.calculate_qc_metrics(adata, inplace=True, percent_top=None, var_type="barcodes")
    return adata

`quicat.read_sc(csv_path)` ¶

This function reads the single-cell or spatial transcriptomics CSV file from the extract pipeline into a Polars DataFrame, extracts the specified columns as metadata, and creates a sparse matrix from the remaining columns. The resulting AnnData object is prepared with the metadata and expression data.