
content
Tutorial
-
1. Click species of interest in classification system that consist hierarchical structure by NCBI taxonomy tree
-
2. User can search species of interest directly by using ‘search panel’ and ‘search panel’ supports auto completion of species name for user-convenience.
-
3. Below the figure, it is example for searching ‘Saccharomyces cerevisiae’. In case by search function, you would show all species to include the typing word.
-
4. Representative genome information for ‘Saccharomyces cerevisiae’ are presented with assembly, taxonomy and GenBank information.
-
5. Users can access detailed information of genome by clicking ‘Genome Browser’ button.
-
6. In ‘Genome Browser’, structural information of genome is presented by circular form with zoom in/out functions ranging from 100% to 1,000%.
-
7. Version of genome could be chosen by database source such as NCBI-Refseq, Ensembl and others. In some case, it can’t show genome because the data is not supported by the database source.
-
8. To download sequence files, users click three icons of ‘Genome’, ‘CDS’ and ‘PEP’.
-
9. A gene search function by position or gene name. Type a gene name in entry of ‘Query’ and then click button of ‘Search’. The gene name is completed automatically if it exist in genome.
-
10. The page indicates to flaking region of gene for users’ search. Additionally, if users want to investigate detailed information for the gene, you could select the gene (“SAF1”). Then, it moves a page of “Gene Viewer”.
-
11. ‘Gene Viewer’ provides detailed information of genes.
-
12. In ‘Gene Information’ panel, locus tag, species name, annotation version and taxonomy information are presented.
-
13. In ‘Gene Structure’ panel, general information about gene structures are provided by various tracks such as gene, mRNA, exon, CDS, ncRNA, repeat region, tRNA and rRNA. Also, you download the sequence of peptide and CDS.
-
14. In ‘Domain Architecture’ panel, domain architectures of genes are provided using IPR terms from pre-performed InterProScan v5.0. The domain information of individual databases such as Pfam, Gene3D, ProSiteProfiles and PANTHER are shown.
-
15. In ‘Subcellular localization’ panel, subcellular localization information predicted from three analysis programs such as TMHMM, MultiLoc2, and TargetP are shown. Detailed information is presented in each subpanels.
-
16. In ‘Ortholog or Paralog information’ panel, ortholog genes of closed-relative species and paralog genes are presented from OrthoMCL analysis. Users can obtain the information of organism, assembly ID and protein ID for candidate orthologs.
-
Step 1. Select a gene numbers to identify.
1. Fill out numbers of genes to identify using Gene Search in entry of ‘No. of subtypes’ and click a submit button.
2. If you type ‘3’ in No. of subtypes, three lines are created to perform Gene Search.
3. Each gene is distinguished by identifier filled with gene name in entry of ‘subtype’ and this identifier is used for header in sequences of FASTA file.
-
Step 2. Perform Gene Search.
1. Fill out domain architectures of genes of interest in entry of ‘Domains’ using IPR terms of domain. Each IPR term is distinguished by comma.
2. To enhance specificity of Gene Search, users fill out entry of ‘with (include genes containing specific IPR terms)’ or ‘without (exclude genes containing specific IPR terms)’ using IPR terms of family, repeat, and site. Each IPR term is distinguished by comma.
3. Select status of ordering of domains between in order or in any order.
4. Select match type using entry of Perfect match.
If you do not select perfect match, all kinds of genes containing domain architectures filled by users with additional domains are searched.
5. Click a search button to execute Gene Search.
-
Step 3. Result of Gene Search.
1. In result window, statistical report of kingdom-wide distribution of genes of interest including domain architectures is shown.
2. Users can download sequences of CDS or peptides by clicking ‘Peptide Download’ or ‘CDS Download’, respectively.
3. Users can also download the sequence by select ‘ALL’ (include alternative spliced genes) or ‘Representative’ (exclude alternative spliced genes)
4. If you want to download kingdom-specific sequences, click panel of certain kingdom in ‘No. of proteins distribution’. The status of each kingdom is indicated by blue line.
5. Users can investigate the domain architectures of searched genes by clicking ‘Viewer Panel on’.
6. To download the sequence file for each gene, click the button of ‘Peptide’ or ‘CDS’.
7. To investigate detailed information of each gene, users can access ‘Gene Viewer’ page by clicking ‘Gene Viewer’ button.
8. According to bottom of window, you move the result pages to click aside button.
-
1. LAST
1. Load the sample reference and sample query. They are mitochondrial DNA sequences of human and fugu.
2. Select ‘DNA’ in both ‘Reference sequence type’ and ‘Query sequence type’.
3. Loaded data is in format of FASTA, select ‘0’ in ‘Input format’.
4. We like to see the result of colored alignments in html format. Select ‘4’ in ‘Output type’ and ‘html’ in ‘Output extension type’.
5. Select options -c -R01
6. In this example, we do not select options in ‘Maximum expected alignments’, ‘Alignment type’, ‘Mach/Mismatch score matrix’, ‘Seeding schema’, and ‘Maximum initial matches’.
7. Click submit button to execute LAST.
8. Results can be accessed via ‘My Page’. In the ‘Detail View’, information of used options is shown. In the ‘View File’, the result of coloured alignments in html format is in ‘maf_convert’ folder. The ‘many to one alignment’ maf file is in ‘maf_split’ folder and presented as ‘output.split.maf’ file. The ‘one to one alignment’ maf file is in ‘maf_swap’ folder and presented as ‘output.split.swap.split.swap.maf’ file. In the dot plot, reference genome is along the top, and query genome is down the side.
-
2. BLAST
1. Users can perform BLAST analysis by fill out sequence, upload users’ files in local computer (Local File), or users’ files in ‘My Gene’ (Cloud File).
2. Select a blast program and database for blast.
3. Fill out entry of ‘EMAIL’ and ‘TITLE’ and click submit button to execute BLAST.
-
3. Multiple sequence alignment
1. To perform multiple sequence alignment, users select tab of alignment method. Then fill out sequence, upload users’ files in local computer (Local File), or users’ files in ‘My Gene’ (Cloud File).
2. Fill out entry of ‘EMAIL’ and ‘TITLE’ and click submit button to execute Multiple sequence alignment.
-
4. InterPro
1. Users can perform InterProScan v5.0 analysis by fill out sequence, upload users’ files in local computer (Local File), or users’ files in ‘My Gene’ (Cloud File).
2. Fill out entry of ‘EMAIL’ and ‘TITLE’ and click submit button to execute InterProScan.
-
5. Phylogenetic Viewer
1. To investigate phylogenetic tree, users can perform Phylogenetic Viewer by filling out phylogenetic tree information of newick format or selecting newick files in ‘My Gene’ (File)
2. Then select a type of phylogenetic tree and click ‘show’ button.
-
1. Users can upload personal data or downloaded data from Prometheus and analyze them (upper panel). Users can also monitor the progress of analysis in ‘My Genes’ and download the result files from each program via a file menu.
-
2. In the case of data from LAST, the result of coloured alignments in html format is in ‘maf_convert’ folder. The ‘many to one alignment’ maf file is in ‘maf_split’ folder and presented as ‘output.split.maf’ file. The ‘one to one alignment’ maf file is in ‘maf_swap’ folder and presented as ‘output.split.swap.split.swap.maf’ file. In the dot plot, reference genome is along the top, and query genome is down the side.
-
3. In the case of data from InterProScan, the result file is shown in a graphic format and results are downloaded in a tsv file format.
-
4. In the case of data from Clustal Omega, phylogenetic tree data (output.tree) is provided with multiple sequence alignment data (output.clu).
-
1. It is executed upon completion of downloading the KoDS high-speed transmission system
(KoDS v.3.5 as of the date of preparing this document) and the initial login screen will be displayed.
Upon selecting your preferred service system, you may log into the system using your user details.
-
2.The list of user local files is displayed on the left and the list of user data stored in the KOBIC cloud storage is displayed on the right. Select a data file you want to transmit and drag & drop it to start transmitting data. The progress status of data transmission is displayed on the monitoring screen on the bottom along with the total transmission capacity and real-time network resource usage status.