Presentation on theme: "Biological Databases Morten Nielsen BioSys, DTU. Different kinds of data DNA –NCBI GenBankNCBI GenBank –Organism specific databases Protein –UniProt SwissProt."— Presentation transcript:
Data redundancy! Databases have non-biological redundancy This is problematic when training data- driven prediction methods –As you saw for PSSM construction Uniprot has a feature to remove redundancy (90% or 50%). How is this done? This and much more you will find out in the next episode of...
Your consent to our cookies if you continue to use this website.