3.1 Full-length transcription sequences
Brain and thoracic ganglia samples of sexually mature giant river prawns were used for single molecular real time (SMART) sequencing. After removing adaptor sequences and low-quality sequences, a total of 73,146,314 subreads (159.1 Gb) were obtained, and the average length of subread was 2,175 bp (Figure 1A). Subread sequences were subjected to self-correction to produce 1,725,793 circular consensus sequence reads (CCS), and the CCS sequences were clustered into 1,509,283 full-length non-chimerics (FLNC). The FLNCs were polished to obtain 99,728 highly quality isoform numbers (288 Mb), with an average length of 2,891 bp (Figure 1B). Finally, 84,627 unigene sequences (246 Mb) with an average read length of 2,913 bp (Figure 1C), were obtained using high quality isoform sequence clustering (identify = 98%). 78,559 transcriptions were found between 1 and 10 isoforms (Figure 1D). The transcript length extended from 170 bp to 14,287 bp, with an average length of 3,061 bp.