There are many datasets that only provide partial data i.e either samples only or other forms like ids in twitter datasets.