Fireworks.ai offers an OpenAI-compatible REST API for querying text models. There are several ways to interact with it
<s>[INST] <<SYS>>Some templates like
{system_prompt}
<</SYS>> {user_message_1} [/INST]
llama-chat
can support multiple chat messages as well. In general, we recommend users use the chat completions API whenever possible to avoid common prompt formatting errors. Even small errors like misplaced whitespace may result in poor model performance.
Here are some examples of calling the chat completions API: