Input RecordIO-protobuf or CSV with tokenized ids Details Number of topics Instance Choice Both CPU and GPU are okay