There are many cases to engineering system prompt (e.g., JSON mode) but mlc-serve currently does not support it.
Although it is not a problem in prod since ollm handles such processing, it would be nice to add support at mlc-serve level for the sake of better debugging experience.