3.1 Full-length transcription sequences
Brain and thoracic ganglia samples of sexually mature giant river prawns
were used for single molecular real time (SMART) sequencing. After
removing adaptor sequences and low-quality sequences, a total of
73,146,314 subreads (159.1 Gb) were obtained, and the average length of
subread was 2,175 bp (Figure 1A). Subread sequences were subjected to
self-correction to produce 1,725,793 circular consensus sequence reads
(CCS), and the CCS sequences were clustered into 1,509,283 full-length
non-chimerics (FLNC). The FLNCs were polished to obtain 99,728 highly
quality isoform numbers (288 Mb), with an average length of 2,891 bp
(Figure 1B). Finally, 84,627 unigene sequences (246 Mb) with an average
read length of 2,913 bp (Figure 1C), were obtained using high quality
isoform sequence clustering (identify = 98%). 78,559 transcriptions
were found between 1 and 10 isoforms (Figure 1D). The transcript length
extended from 170 bp to 14,287 bp, with an average length of 3,061 bp.