Latent Dirichlet Allocation

#aws #sagemaker #natural language processing #topic modeling

Input

RecordIO-protobuf or CSV with tokenized ids id:count

Details

  • Number of topics

Hyperparameters

  • Alpha
    • the concentration parameter: small value -> sparse topic mixtures

Instance Choice

Only CPU