Machine learning on biological datasets