
For many decades, Stipa has been described as a genus with over 300 species common in steppe zones of Eurasia, North Africa, Australia and the Americas 36, 37. In the fossil record Stipa L., or a close relative genus, is known from about 34 Mya of the upper Eocene 34, 35. The so-called single-molecule real-time (SMRT) sequencing provided by Pacific Biosciences (PacBio) 31 and nanopore sequencing by Oxford Nanopore Technologies 32 afford a range of benefits, including exceptionally long-read lengths (20 kb or more), resolving extremely repetitive and GC-rich regions and direct variant phasing 32, 33. Currently, to face these issues, the third-generation sequencing has been applied. In comparison to other kingdoms, plants have very large genomes 13, 25, 26, high ploidy level 27 and the abundance of repetitive sequences 28, 29, 30. Nevertheless, the data at the level of genera may not be processed immediately 24. The project is supposed to be completed in 2023 and it presumes to provide family-level high-quality reference genomes, ideally with chromosome-scale assemblies. The later initiative intends to sequence complete genomes from more than 10,000 plants and protists. Recently, the 1KP project that was aiming to sequence 1,000 green plant transcriptomes 21, 22, 23 has been followed by the 10KP project 24. The advent of second-generation sequencing and the subsequent decreasing of the overall sequencing costs have enabled the determination of whole genome sequences in many non-model plant species 17, 18, 19, 20. In the grass family (Poaceae) the reference assemblies were primarily obtained for crops 11, 12, 13 and model plants 14, 15, 16. green algae 2, 3, bryophytes 4, 5, ferns 6, gymnosperms 7, 8 and angiosperms 9, 10. Since then, many genomes from the plant kingdom have been sequenced, e.g. genome became the first plant genome to be completely sequenced and assembled 1. In the year 2000, the Arabidopsis thaliana L. The assembled nuclear, mitochondrial and chloroplast genomes provide a significant source of genetic data for further works on phylogeny, hybridisation and population studies within Stipa and the grass family Poaceae. capillata, investigate the evolutionary history of the genus, demonstrate structural heteroplasmy of the chloroplast genome and announce for the first time the mitochondrial genome in Stipa. Importantly, here we report 81,224 annotated protein-coding genes, present 77,614 perfect and 58 unique imperfect SSRs, reveal the putative allopolyploid nature of S. A draft genome of about 1004 Mb was obtained with a contig N50 length of 351 kb. Thus, here we present a single-molecule long-read sequencing dataset generated using the Pacific Biosciences Sequel System. Although large-scale molecular data has begun to appear, there is still no complete or draft genome for any Stipa species. Moreover, some species are of economic importance mainly as fodder plants and can be used for soil remediation processes. Many taxa of the genus are dominants in steppe plant communities and can be used for their classification and in studies related to climate change. The Eurasian plant Stipa capillata is the most widespread species within feather grasses.
