--- library_name: peft base_model: codellama/CodeLlama-7b-hf --- # Model Card for Model ID 4 bit general purpose text-to-SQL model. Takes 5677MiB of GPU memory. ## Model Details ### Model Description Provide the CREATE statement of the target table(s) in the context of your prompt and ask a question to your database. The model outputs a query to answer the question. Data used for fine tuning: https://huggingface.co/datasets/b-mc2/sql-create-context - **Developed by:** [More Information Needed] - **Model type:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] - **Finetuned from model [optional]:** codellama/CodeLlama-7b-hf ## Uses This model can be coupled with a chat model like llama2-chat to convert the output into a text answer. ### Direct Use ```python from peft import AutoPeftModelForCausalLM from transformers import AutoTokenizer, AutoModelForCausalLM import torch # load model base_model = "GTimothee/sql-code-llama-4bits" model = AutoModelForCausalLM.from_pretrained( base_model, load_in_4bit=True, torch_dtype=torch.float16, device_map="auto", ) model.eval() # load tokenizer tokenizer = AutoTokenizer.from_pretrained("codellama/CodeLlama-7b-hf") eval_prompt = """You are a powerful text-to-SQL model. Your job is to answer questions about a database. You are given a question and context regarding one or more tables. You must output the SQL query that answers the question. ### Input: Which Class has a Frequency MHz larger than 91.5, and a City of license of hyannis, nebraska? ### Context: CREATE TABLE table_name_12 (class VARCHAR, frequency_mhz VARCHAR, city_of_license VARCHAR) ### Response: """ model_input = tokenizer(eval_prompt, return_tensors="pt").to("cuda") with torch.no_grad(): print(tokenizer.decode(model.generate(**model_input, max_new_tokens=100)[0], skip_special_tokens=True)) ``` Outputs: ```python ### Response: SELECT class FROM table_name_12 WHERE frequency_mhz > 91.5 AND city_of_license = "hyannis, nebraska" ``` ## Bias, Risks, and Limitations - potential security issues if there is a malicious use. If you execute blindly the SQL queries that are being generated by end users you could lose data, leak information etc. - may be mistaken depending on the way the prompt has been written. ### Recommendations - Make sure that you check the generated SQL before applying it if the model is used by end users directly. - The model works well when used on simple tables and simple queries. If possible, try to break a complex query into multiple simple queries.