42dot_LLM-PLM-1.3B_GGUF

설명

42dot 모델의 GGUF 경량화 모델을 만들어 뒀습니다.

링크에 연결 해두었으니 필요하신 분은 하트 주고 챙겨 가세요. gguf 원본 파일

Q4, Q8 경량화 파일

이외 모델은 근본 없어서 올릴까 하다가 안 올리려고 합니다.

원본 링크에서 사용 법을 확인하세요.

For simple inferencing, use a command similar to

./main -m gguf-q4_k_m.gguf --temp 0 --top-k 4 --prompt "who was Joseph Weizenbaum?"

To get a list of tokens, use a command similar to

./tokenization -m gguf-q4_k_m.gguf --prompt "who was Joseph Weizenbaum?"

Text embeddings are calculated with a command similar to

./embedding -m gguf-q4_k_m.gguf --prompt "who was Joseph Weizenbaum?"

원본 모델 라이센스는 Creative Commons Attribution-NonCommercial 4.0 (CC BY-NC 4.0) 참고하세요.