See some notes here, this error is logged but should probably be treated as an error: https://github.com/srlearn/datasets/issues/19