werks on top of llama.cpp commit c47cf414efafb8f60596edc7edb5a2d68065e992