Skip to main content

Computational pipelines for the analysis of plastic-degrading genes

Submission Number: 180
Submission ID: 3985
Submission UUID: 399fee51-c671-4acd-8572-060c448fd732
Submission URI: /form/project

Created: Sat, 08/26/2023 - 06:41
Completed: Sat, 08/26/2023 - 06:44
Changed: Thu, 06/13/2024 - 14:36

Remote IP address: 104.28.39.76
Submitted by: Gaurav Khanna
Language: English

Is draft: No
Webform: Project
Computational pipelines for the analysis of plastic-degrading genes
CAREERS
anthrax-image-protein-units-drug-factor-unit.jpg
bioinformatics (277), biology (515), workflow (365), workforce-development (337)
Complete

Project Leader

Ying Zhang
{Empty}
{Empty}

Project Personnel

{Empty}
Aidan McCrillis
{Empty}

Project Information

The prevalence of microplastics is posing increasing problems to the environment and human health. Despite the identification of several plastic-degrading pathways, their presence and evolution among diverse microorganisms is largely unexplored.

In this project, we will develop a computational pipeline using the Snakemake workflow management system to assemble bioinformatics tools for the identification and evolutionary analysis of plastic-degrading proteins. We will also examine the abundance of these proteins by mining metagenomic data. Snakemake workflow management system is a python-based tool to create reproducible and scalable data analyses. The supported student will work with PI and other lab researchers to understand the analysis workflow and bioinformatics tools and build a computational pipeline using Snakemake. This effort will include elements of HPC, bioinformatics, python programming, and git version controls.

Project Information Subsection

{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
University of Rhode Island
{Empty}
CR-University of Rhode Island
{Empty}
No
Already behind3Start date is flexible
6
{Empty}
{Empty}
  • Milestone Title: Milestone #1
    Milestone Description: Determine project scope, Snakemake work environment, launch presentation.
    Completion Date Goal: 2023-10-30
  • Milestone Title: Milestone #2
    Milestone Description: Bioinformatics tool assembly and testing, finalize pipeline design.
    Completion Date Goal: 2023-11-30
  • Milestone Title: Milestone #3
    Milestone Description: Complete working prototype, project git
    Completion Date Goal: 2023-12-30
  • Milestone Title: Milestone #4
    Milestone Description: Extensive testing and debugging, application and data analysis.
    Completion Date Goal: 2024-02-28
  • Milestone Title: Milestone #5
    Milestone Description: Wrap up development, update project git and documentation, wrap presentation
    Completion Date Goal: 2024-03-30
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}

Final Report

This project will help to streamline the identification of potential plastic degrading genes and proteins and help to create reproducible results that can easily be built upon by other researchers.
This probably won't have a large impact on other disciplines but will allow people outside bioinformatics to use this pipeline without having a comprehensive knowledge of the packages and modules that were used in the pipeline.
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
This project will help to further knowledge about plastic degrading proteins with the aim of helping to solve the problem of plastic pollution and finding a way to degrade the plastics that are in the ocean.
I think I gained a lot of valuable knowledge in the process of building a workflow and what considerations should be taken to make a product that is usable for other people. I also gained a lot of experience with dealing with large sets of bioinformatics data and how to make sense of such a large dataset.
The overall result of the project I worked on is a workflow that allows for the identification and sorting of potential plastic degrading genes.