WPAIGPT-SQL-01 is a specialized text-to-SQL model designed for WordPress and WordPress plugins. It generates SQL queries based on natural language requests, with a focus on WordPress-specific database structures and popular plugins.
Model Details
Model Description
WPAIGPT-SQL-01 is a fine-tuned version of the Qwen2.5-Coder-7B model, optimized for generating SQL queries for WordPress databases. It can handle queries related to core WordPress tables as well as tables added by various plugins.
- Developed by: WPAI Inc, James LePage
- Funded by: WPAI Inc
- Model type: Text-to-SQL Language Model
- Language(s) (NLP): English
- License: Apache 2.0
- Finetuned from model: Qwen2.5-Coder-7B-Instruct
Uses
Direct Use
The model is designed for direct text-to-SQL generation for WordPress databases. Users can input natural language requests, optionally including plugin names, versions, and table descriptions, to generate SQL queries. This is particularly useful for:
- Retrieving information from WordPress databases
- Adding functionality to existing WordPress plugins by generating SQL queries
- Assisting developers in creating database queries for WordPress projects
Downstream Use
- Integration into WPAI products, primarily AgentWP, for real-time information retrieval from WordPress websites
- Use in code generation tools to create queries for more complete WordPress systems like plugins
- Incorporation into agent pipelines for WordPress-related tasks
Out-of-Scope Use
While there are no strict out-of-scope uses, users should be aware that as a Transformer-based model, it can potentially hallucinate or generate incorrect queries. All generated SQL should be verified before execution against a live WordPress database.
Bias, Risks, and Limitations
- The model may be biased towards more popular WordPress plugins and those with more extensive database interactions.
- There's a bias towards SELECT and read-only operations over database-modifying queries.
- The model's knowledge is limited to the training data, which may not cover all possible WordPress plugins or database structures.
- As with any language model, there's a risk of generating syntactically correct but logically incorrect or potentially harmful SQL queries.
Recommendations
- Always verify and test generated SQL queries before executing them on a live WordPress database.
- Use in conjunction with proper access controls and user authentication to prevent unauthorized database access.
- Regularly update the model to include knowledge of new WordPress versions and popular plugins.
- Implement additional safety checks and validations when using the model in automated systems.
Training Details
Training Data
The training data consists of hundreds of thousands of instruction-to-SQL examples, structured as follows:
- 25% include described tables that WordPress plugins may add, along with plugin name, version, and instruction
- 25% include only the plugin name, version, and instruction
- 50% include only the instruction
The queries are derived from popular WordPress plugins, both from the official WordPress repository and premium plugins. The data generation process involves:
- Indexing plugin codebases
- Extracting code that manipulates the WordPress database
- Synthetically generating SQL queries
- Verifying queries by running them against a WordPress installation with the plugin installed
There's a bias towards the most popular WordPress plugins and those with significant database interactions. Additional manual data has been included for specific plugins like WooCommerce, LearnDash, and Gravity Forms.
Training Procedure
The training procedure details are available in the provided Python notebook. For specific information about hyperparameters, preprocessing steps, and other training details, please refer to the notebook.
Evaluation
Testing Data, Factors & Metrics
Formal evaluations have not been conducted. The model's performance is primarily assessed through:
- A/B testing in WPAI products
- User rankings on end systems (AgentWP, CodeWP, and other WPAI products)
Technical Specifications
Model Architecture and Objective
The model is based on the Qwen2.5-Coder-7B architecture, fine-tuned for the specific task of WordPress SQL generation. It uses a causal language modeling objective to generate SQL queries based on natural language inputs.
Key features of the base Qwen2.5-Coder-7B model include:
- Number of Parameters: 7.61B
- Number of Layers: 28
- Number of Attention Heads: 28 for Q and 4 for KV (using Grouped-Query Attention)
- Context Length: Full 131,072 tokens (with the ability to handle long contexts using YaRN technique)
The model has been specifically fine-tuned to understand WordPress database structures and generate appropriate SQL queries, maintaining its coding capabilities while focusing on the WordPress ecosystem.
- Downloads last month
- 4