mas.to is one of the many independent Mastodon servers you can use to participate in the fediverse.
Hello! mas.to is a fast, up-to-date and fun Mastodon server.

Administered by:

Server stats:

12K
active users

#singlecell

0 posts0 participants0 posts today

#Genomics #SingleCell

These FASTQ files are driving me batty.

I have FASTQ files for a Chromium prepared library. I run cutadapt to try and limit bad reads. I run RNASTAR solo with correct parameters to extract the read UMI and GEM barcode into UM CB tags of BAM file.

I run umitools to dedup based on barcode.

AssertionError: not all umis are the same length(!): 1 - 10

Why. Is umitools. Dying??? All the barcodes and UMIs should be the same length. They're fixed length pulls from the read.

Any #rstats #Genomics #SingleCell people used scSplit for genetic demultiplexing of non-hashed mixed samples?

On the GitHub page it says:

"If necessary, remove duplicated reads based on UMI using tools like rmdup in UMI-tools."

github.com/jon-xu/scSplit

That makes sense. But shouldn't it really be by GEM *and* UMI? Just UMI would mix GEMs, and that would exclude identically mapping reads from different droplets. Unless umitools checks *both*. I've only used it on fastq files before, not BAM.

GitHubGitHub - jon-xu/scSplit: Genotype-free demultiplexing of pooled single-cell RNA-Seq, using a hidden state model for identifying genetically distinct samples within a mixed population.Genotype-free demultiplexing of pooled single-cell RNA-Seq, using a hidden state model for identifying genetically distinct samples within a mixed population. - jon-xu/scSplit