Welcome to Tabula Sapiens!
For a complete walkthrough the resource checkout our portal: http://tabula-sapiens-portal.ds.czbiohub.org/
To search extended donor metadata, go to ChatTS, and see the instructions for help.
Since April 2021, Tabula Sapiens data have been made available to all users free of charge. This product is part of the AWS Open Data Sponsorship Program and contains data sets that are publicly available for anyone to access and use. AWS has made the data freely available so that anyone can download the resource to perform analysis and advance medical discovery without needing to worry about the cost of storing Tabula Sapiens data or the time required to download it. The data can be browsed but before you can download we require users to complete a data sharing agreement.
The s3 public bucket follows this example structure:
aws s3 ls czb-tabula-sapiens
├──TabulaSapiens_v1_Science2022
│ ├── Donor1/
│ ├── Donor10/
│ ├── Donor11/
│ ├── Donor12/
│ ├── Donor13/
│ ├── Donor14/
│ ├── Donor15/
│ ├── Donor2/
│ │ ├── alignment-gencode/
│ │ │ ├── 10X/
│ │ │ ├── smartseq2/
│ │ ├── fastqs/
│ │ │ ├── 10X/
│ │ │ ├── smartseq2/
│ │ ├── gene-count-tables/
│ │ ├── immune-repertoire-analysis/
│ │ │ ├── bracer/
│ │ │ ├── tracer/
│ ├── Donor3/
│ ├── Donor4/
│ ├── Donor5/
│ ├── Donor6/
│ ├── Donor7/
│ ├── Donor8/
│ ├── Donor9/
│ ├── reference/
│ │ ├── cellranger/
│ │ │ ├── homo.gencode.v30.annotation.ERCC92.tgz
│ │ ├── STAR/
│ │ │ ├── homo.gencode.v30.annotation.ERCC92.tgz
│ ├── ...
├──TabulaSapiens_v2
│ ├── reference/
│ │ ├── gencode_v41_cellranger.tar.gz
│ │ ├── gencode_v41_ercc_star.tgz
│ │ ├── ...
│ ├── Donor1/
│ ├── Donor2/
│ │ ├── alignment-gencode/
│ │ │ ├── smartseq2/
│ │ │ ├── TSP2_...
│ │ ├── fastqs/
│ │ │ ├── 10X/
│ │ │ ├── smartseq2/
│ ...
Each DonorN folder contains all the raw data for TSPN. Each fastqs folder is split between 10x and smartseq with respective files inside, an identical strucutre followed by the alignment-gencode folder. The gene-count-tables correspond the smartseq2 only. For the donors for which we generated smartseq2 we also provide the raw outputs for bracer and tracer inside the respective immune-repertoire-analysis folder. The reference files to use to reprocess the entire dataset from fastqs onwards are provide in reference.
For instructions on how to create an AWS account (free of charge) please refer to AWS documentation
Our ready-to-use data is available from figshare: https://figshare.com/projects/Tabula_Sapiens/100973 All files can be interactively explored in the cellxgene portal If you would like to use cellxgene to explore the data on your local machine consider using exploratory-cellxgene
Checkout detailed instructions here
