NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM6958403 Query DataSets for GSM6958403
Status Public on Jul 22, 2023
Title Nephrotoxic Nephritis sample of BTLA-KO mouse (replicate 1)
Sample type SRA
 
Source name Knock out
Organism Mus musculus
Characteristics cell type: renal leukocytes
treatment: Knock-out
tissue: Kidney
disease state: Nephrotoxic Nephritis
Treatment protocol Induction of nephrotoxic nephritis 10 days prior to organ harvest
Growth protocol Not applicable
Extracted molecule total RNA
Extraction protocol kidneys where digested, CD45+ leukocytes isolated using MACS beads, then sorted using FACS gating on Living (DAPIneg),CD45+
10xGenomics 3`scRNAseq
 
Library strategy RNA-Seq
Library source transcriptomic single cell
Library selection cDNA
Instrument model Illumina NovaSeq 6000
 
Data processing Libraries were sequenced on an Illumina NovaSeq 6000 sequencing instrument with 29+89bp read length.
Fastqs were generated using the command mkfastq of the 10x Genomics software "cellranger" (v6.1.2). The filtered gene count matrices of the four samples were generated using cellranger's count command and the mm10 reference genome (version preprocessed by cellranger). For the two WT and the two BtlaKO samples, 3.5K, 5.4K, 6.4K, and 8.4K cells were recovered, respectively. Sequencing depths of the samples were about 64.9K, 60.3K, 52.2K, and 37.2K reads per cell in the same order. That resulted in a median number of genes per cell between 2K and 2.3K for each sample. All given numbers were rounded to the first decimal place.
Downstream analysis was performed using the Seurat R package (version 4.1.1) 29. Cells with a very small (<500 genes) or large (>5000 genes) transcriptome recovery were discarded to reduce noise and the number of possible doublets, respectively. Only cells with mitochondrial expression of less than 5% and ribosomal expression of less than 50% were kept to avoid working with low-quality cells in downstream steps. For normalization, we have used the standard Log-normalization approach with the standard scaling factor (10e4). Samples were then integrated using the standard Seurat integration procedure.
We followed the standard Seurat-workflow to perform principal component analysis, extracting the first 100 principal components. Based on the data generated by Seurat's Elbow plot function, we automatically determined the number of dimensions used for dimensionality reduction and clustering of all immune cells. In particular, we selected the dimension where the standard deviation starts to drop below 2; this automatic selection procedure was confirmed by visual inspection of the elbow plot. Subsequently, we have executed the UMAP algorithm and the FindNeighbors function with the aforementioned dimensions. Clustering was performed using the FindClusters function with a resolution of 0.1. Otherwise, we used standard parameters. The dimensionality reduction and clustering procedure for the T cells was performed similarly, but we used only the first 12 dimensions by visual inspection of the elbow plot, as the complexity and variability within T cell population drops as expected significantly compared to first level of resolution (all immune cells). In this step we used a clustering resolution of 0.6. Cluster 6 was identified as contamination with macrophages, probably due to the formation of doublets and was discarded. The other cells were reclustered using the same workflow, this time with a clustering resolution of 0.5. For all downstream steps we worked with the remaining 5412, 7361, 2939, and 4521 cells for the knockout and wild-type samples, respectively. At T cell level, we worked accordingly with 826, 1433, 366, and 609 cells in the same mentioned order.
Markers of the clusters at both clustering levels were computed with the Wilcoxon rank sum test implemented in Seurat using the FindAllMarkers function. The top 20 positive markers of each cluster were used to manually curate the cell types of the immune cells (the initial clustering level of all cells). The reference database used to determine the cell types is the single-cell RNASeq database PanglaoDB30. We determined the identity of the resulting cluster subpopulations by visualizing the expression patterns of most basic markers (e.g., Cd4+, Cd8+, and some alpha, beta, gamma, and delta chain genes) across clusters to understand the approximate structure of T cell subpopulations in our dataset. Subsequently, we have computed the module scores of many known standard subpopulations of T cells for each cell in each cluster using the AddModuleScore function. By box plotting the distribution of the scores of each module for each cluster, we identify the subpopulations of the T cells by visually inspecting and comparing the module score distributions. To assign a cell type to a subpopulation we require that the median score is at least higher than zero or that no other type is fitting better while the median being close to zero. In the case that two cell type modules are fitting equally well into the same subcluster, we annotate the cluster with both unless we could biologically exclude one of them. For instance, in the case of clusters where almost all cells are Cd4+, we exclude the possibility of the cluster being a subpopulation of cytotoxic T cells. Cycling T cells were recognized using cell cycle genes integrated within Seurat and corresponding G2M and S scores. The markers of the various T cell subpopulations were taken from the Biocompare website
Assembly: mm10 (preprocessed release downloaded from the website of 10x Genomics)
Supplementary files format and content: four csv files for the raw counts generated by cellranger for each sample and three additional csv files; the first for the integrated and filtered object from the four raw samples and the other two contain the cell type annotation of all immune cells and the fine-granuled annotation of the T-cells, respectively
 
Submission date Jan 23, 2023
Last update date Jul 22, 2023
Contact name Ali T Abdallah
Organization name University of Cologne
Department Medical Faculty
Lab CECAD
Street address Joseph Stelzmann Str. 26
City Cologne
ZIP/Postal code 50931
Country Germany
 
Platform ID GPL24247
Series (1)
GSE223534 BTLA NTN scRNAseq
Relations
BioSample SAMN32878342
SRA SRX19144161

Supplementary file Size Download File type/resource
GSM6958403_KO_R1.csv.gz 16.7 Mb (ftp)(http) CSV
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap