llm-infer

Benchmark and identify the best ways to speedup LLM inference.

Resources

prompt-engineering-guide : This has mistral specific details.

For structuring experimets(mlops): https://github.com/vin136/MLOPS

Fine tuning llms

Hugging face blog/references

End-point detection

basically, use an ML model to detect end-of-command.

Practical solutions:

Learning

TODO:

Benchmark and identify the best ways to speedup LLM inference.

Language:Jupyter Notebook 84.6%Language:Python 13.5%Language:Makefile 1.9%