Vitor C. Piro

Research Interests

Development of methods for bioinformatics, currently with focus on microbial communities, environmental data and metagenomics. I am also interested in many related fields: binning, profiling and assembly of community samples, amplicon and marker gene analysis, phylogeny, taxonomic classification, species delineation, contamination, high performance algorithms, sequence mapping/alignment, antimicrobial resistance, genome assembly and finishing methods.

Software

§ GRIMER automates analysis and reports an offline and interactive dashboard integrating annotation, taxonomy and metadata to analyse microbiome studies and detect contamination.

§ MultiTax is a Python package that provides a common and generalized set of functions to download, parse, filter and explore multiple biological taxonomies (GTDB, NCBI, Silva, Greengenes, Open Tree taxonomy) and custom formatted taxonomies.


§ ganon is a k-mer based short read classification tool which uses Interleaved Bloom Filters in conjunction with a taxonomic clustering and a k-mer counting-filtering scheme. Ganon strength lies in providing an ultra-fast indexing method for very large sets of reference sequences and a high classification precision.


§ TaxSBP is am implementation of the approximation algorithm for the hierarchically structured bin packing problem based on the NCBI Taxonomy database.


§ genome_updater is a portable bash script to download and update files from NCBI genomes, keeping log and version for each update, with file check and parallel download support.


§ MetaMeta is a pipeline to execute and integrate results from metagenome analysis tools. It provides an easy workflow to run multiple tools with multiple samples, producing a single enhanced output profile for each sample.


§ DUDes is a reference-based taxonomic profiler with a top-down approach to analyze metagenomic NGS samples. Instead of using the lowest common ancestor we developed the deepest uncommon descendent.


§ FGAP is an automated gap closing tool. It uses BLAST to align multiple contigs against a draft genome assembly aiming to find sequences that overlap gaps. The algorithm selects the best sequence to fill and eliminate the gap.

Publications




Webpage template from CV Maker | Publication list generated automatically with bibtex-js