By looking at summ_for_scaffolds.txt, we see that some collectContigs assemblies don't have any reads map to them, in expectation, according to RSEM. This is confusing. Let's check whether blat aligns any reads to them. Specifically, we take a collectContigs assembly, ws30, which has 12 transcripts with no reads mapping to them in expectation, according to RSEM (see ./sum_for_scaffolds.txt). For every transcript named in trinity-gridsearch/rsem_oracleset.ws30.fa we look in trinity-gridsearch/rsem_oracleset-summary.ws30.rsem-eval-workingdir/expression.temp/expression.sam.gz to see if there is at least one pair of reads that maps to it: sam=trinity-gridsearch/rsem_oracleset-summary.ws30.rsem-eval-workingdir/expression.temp/expression.sam.gz for id in `dl-fasta-list scaffold_11 tid=2348 pos=1943 length=182 number_of_contigs=2 CTTTTTTTTTTTTTTTAATTTTTTAATTTTTTATTTTTATTTTTATTTTGGTTTGTGGAAATTCTTTAGGTTTTAAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTGTAGACACTGGGGGCTCTAATAAAAGTGACTCACCAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA We see that it's plausible for it to be counted as a match, but it is certainly not a great match.