The MovieLens sample dataset used in PoC-in-a-Box workshop includes a few movies without years in their title (e.g. ID 162414 "Moonlight").
If I'm not mistaken (I had tweaked the extraction slightly in my fork), the YEAR field as extracted by regex has some blanks. However, the item metadata schema in notebook 2 tags:
{
"name": "YEAR",
"type": "int"
}
Doesn't it need to be "type": ["int", "null"] for this field to be picked up correctly by the model?