Skip to main content

Future plan

  • prompt extensions
  • more platforms and cross compile (performance related)
  • tweak embedding API, make end token configurable
  • cli and interactive
  • GPU inference