R&D/AI

Llama 2 in Apple Silicon Macbook (3/3)

sunshout 2023. 10. 29. 15:19

This chapter is testing the Llama 2 easily.

I am going to deploy web service using FastAPI and Llama 2.

Run web server

git clone https://github.com/choonho/llama_server.git

cd llama_server
pip3 install llama-cpp-python langchain
pip3 install fastapi uvicorn

In step 2, we created Llama 2 model file, copy to "models/7B/ggml-model-q4_0.bin"

python3 server.py

FastAPI provides easy way to test API.

Click "Try it out"

In the "Request body", ask your question.

가상화, 아파트, 네트워크, 팁, 라우터, Xen, 미완성, 논문, latex, 분양, Eclipse, OVM, 회사, Hadoop, ns, PyQt4, Python, CloudStack, HBase, C,