Description of project

When scientists collect their data there tends to be an overlap in the acquired data. Due to this computer resources and space are wasted for this duplicated data. In this project we will focus on techniques to help scientists track and identify this duplicated data so that it can be dealt with. We will do this by developing MinHash algorithms. Doing this we hope will help scientists find duplicated data and not have to waste computer space and resources.

Comments

Popular posts from this blog

Week 5

week 4 weekend

Week 3 (06/17/19 - 06/21/19)