PyPI - vanna - Versions diffs - 0.0.6__tar.gz → 0.0.11__tar.gz - Mend

vanna 0.0.6tar.gz → 0.0.11tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

vanna-0.0.11/PKG-INFO +156 -0
vanna-0.0.11/README.md +142 -0
{vanna-0.0.6 → vanna-0.0.11}/pyproject.toml +2 -2
{vanna-0.0.6 → vanna-0.0.11}/src/vanna/__init__.py +326 -73
{vanna-0.0.6 → vanna-0.0.11}/src/vanna/types.py +26 -3
vanna-0.0.11/src/vanna.egg-info/PKG-INFO +156 -0
{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/requires.txt +1 -0
vanna-0.0.6/PKG-INFO +0 -20
vanna-0.0.6/README.md +0 -6
vanna-0.0.6/src/vanna.egg-info/PKG-INFO +0 -20
{vanna-0.0.6 → vanna-0.0.11}/LICENSE +0 -0
{vanna-0.0.6 → vanna-0.0.11}/setup.cfg +0 -0
{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/SOURCES.txt +0 -0
{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/dependency_links.txt +0 -0
{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/top_level.txt +0 -0

vanna-0.0.11/PKG-INFO ADDED Viewed

@@ -0,0 +1,156 @@
+Metadata-Version: 2.1
+Name: vanna
+Version: 0.0.11
+Summary: Generate SQL queries from natural language
+Author-email: Zain Hoda <zain@vanna.ai>
+Project-URL: Homepage, https://github.com/vanna-ai/vanna-py
+Project-URL: Bug Tracker, https://github.com/vanna-ai/vanna-py/issues
+Classifier: Programming Language :: Python :: 3
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Operating System :: OS Independent
+Requires-Python: >=3.7
+Description-Content-Type: text/markdown
+License-File: LICENSE
+| GitHub | PyPI | Colab | Documentation |
+| ------ | ---- | ----- | ------------- |
+| [![GitHub](https://img.shields.io/badge/GitHub-vanna--py-blue?logo=github)](https://github.com/vanna-ai/vanna-py) | [![PyPI](https://img.shields.io/pypi/v/vanna?logo=pypi)](https://pypi.org/project/vanna/) | [![Colab](https://img.shields.io/badge/Colab-vanna--py-blue?logo=google-colab)](https://colab.research.google.com/github/vanna-ai/vanna-py/blob/main/notebooks/vn-starter.ipynb) | [![Documentation](https://img.shields.io/badge/Documentation-vanna--py-blue?logo=read-the-docs)](https://docs.vanna.ai) |
+# Vanna.AI
+Vanna is a Python-based AI SQL co-pilot. Our initial users are data-savvy data analysts, data scientists, engineers, and similar people that use Vanna to automate writing complex SQL.
+Vanna can:
+- [Convert natural language to SQL](#natural-language-to-sql)
+- [Run SQL](#run-sql)
+- [Generate Plotly code](#generate-plotly-code)
+- [Run Plotly code](#run-plotly-code)
+- [Get better over time](#improve-your-training-data)
+- Be used in a Jupyter Notebooks, Colab, or other Python environments
+- Be used with Snowflake, BigQuery, and other databases
+- Be used with Python UIs, such as [Streamlit](https://github.com/vanna-ai/vanna-streamlit), Dash, and others
+- Be used to make Slack bots
+## Natural Language to SQL
+```python
+sql = vn.generate_sql(question='Who are the top 10 customers?')
+```
+### Output:
+```sql
+SELECT customer_name,
+       total_sales
+FROM   (SELECT c.c_name as customer_name,
+               sum(l.l_extendedprice * (1 - l.l_discount)) as total_sales,
+               row_number() OVER (ORDER BY sum(l.l_extendedprice * (1 - l.l_discount)) desc) as rank
+        FROM   snowflake_sample_data.tpch_sf1.lineitem l join snowflake_sample_data.tpch_sf1.orders o
+                ON l.l_orderkey = o.o_orderkey join snowflake_sample_data.tpch_sf1.customer c
+                ON o.o_custkey = c.c_custkey
+        GROUP BY customer_name)
+WHERE  rank <= 10;
+```
+## Run SQL
+This function is provided as a convenience. You can choose to run your SQL however you normally do and use the rest of the downstream functions.
+```python
+df = vn.get_results(cs, database, sql)
+```
+### Output:
+| customer_name | total_sales |
+| ------------- | ----------- |
+| Customer#000000001 |  68127.72 |
+| Customer#000000002 |  65898.69 |
+...
+## Generate Plotly Code
+```python
+plotly_code = vn.generate_plotly_code(question=my_question, sql=sql, df=df)
+```
+### Output:
+```python
+fig = go.Figure(go.Bar(
+    x=df['CUSTOMER_NAME'],
+    y=df['TOTAL_SALES'],
+    marker={'color': df['TOTAL_SALES'], 'colorscale': 'Viridis'},
+    text=df['TOTAL_SALES'],
+    textposition='auto',
+))
+fig.update_layout(
+    title="Top 10 Customers by Sales",
+    xaxis_title="Customer",
+    yaxis_title="Total Sales",
+    xaxis_tickangle=-45,
+    yaxis_tickprefix="$",
+)
+```
+## Run Plotly Code
+```python
+fig = vn.get_plotly_figure(plotly_code=plotly_code, df=df)
+fig.show()
+```
+### Output:
+![Top 10 Customers by Sales](docs/chart.png)
+## Improve Your Training Data
+```python
+vn.store_sql(
+    question=my_question,
+    sql=sql,
+)
+```
+## How Vanna Works
+```mermaid
+flowchart LR
+    DB[(Known Correct Question-SQL)]
+    Try[Try to Use DDL/Documentation]
+    SQL(SQL)
+    Check{Is the SQL correct?}
+    Generate[fa:fa-circle-question Use Examples to Generate]
+    DB --> Find
+    Question[fa:fa-circle-question Question] --> Find{fa:fa-magnifying-glass Do we have similar questions?}
+    Find -- Yes --> Generate
+    Find -- No --> Try
+    Generate --> SQL
+    Try --> SQL
+    SQL --> Check
+    Check -- Yes --> DB
+    Check -- No --> Analyst[fa:fa-glasses Analyst Writes the SQL]
+    Analyst -- Adds --> DB
+```
+# Getting Started
+## Install Vanna from PyPI and import it:
+```python
+%pip install vanna
+import vanna as vn
+```
+## Enter your email to set an API Key
+This will send a one-time code to your email address. Copy and paste the code into the prompt.
+```python
+my_email = '' # Enter your email here
+vn.login(email=my_email)
+```
+## Add Training Data
+```python
+vn.train(
+    question="Which products have the highest sales?",
+    sql="...",
+)
+```
+## Generate SQL
+```python
+sql = vn.generate_sql(question="Who are the top 10 customers?")
+```
+# Documentation
+[Full Documentation](https://docs.vanna.ai)

vanna-0.0.11/README.md ADDED Viewed

@@ -0,0 +1,142 @@
+| GitHub | PyPI | Colab | Documentation |
+| ------ | ---- | ----- | ------------- |
+| [![GitHub](https://img.shields.io/badge/GitHub-vanna--py-blue?logo=github)](https://github.com/vanna-ai/vanna-py) | [![PyPI](https://img.shields.io/pypi/v/vanna?logo=pypi)](https://pypi.org/project/vanna/) | [![Colab](https://img.shields.io/badge/Colab-vanna--py-blue?logo=google-colab)](https://colab.research.google.com/github/vanna-ai/vanna-py/blob/main/notebooks/vn-starter.ipynb) | [![Documentation](https://img.shields.io/badge/Documentation-vanna--py-blue?logo=read-the-docs)](https://docs.vanna.ai) |
+# Vanna.AI
+Vanna is a Python-based AI SQL co-pilot. Our initial users are data-savvy data analysts, data scientists, engineers, and similar people that use Vanna to automate writing complex SQL.
+Vanna can:
+- [Convert natural language to SQL](#natural-language-to-sql)
+- [Run SQL](#run-sql)
+- [Generate Plotly code](#generate-plotly-code)
+- [Run Plotly code](#run-plotly-code)
+- [Get better over time](#improve-your-training-data)
+- Be used in a Jupyter Notebooks, Colab, or other Python environments
+- Be used with Snowflake, BigQuery, and other databases
+- Be used with Python UIs, such as [Streamlit](https://github.com/vanna-ai/vanna-streamlit), Dash, and others
+- Be used to make Slack bots
+## Natural Language to SQL
+```python
+sql = vn.generate_sql(question='Who are the top 10 customers?')
+```
+### Output:
+```sql
+SELECT customer_name,
+       total_sales
+FROM   (SELECT c.c_name as customer_name,
+               sum(l.l_extendedprice * (1 - l.l_discount)) as total_sales,
+               row_number() OVER (ORDER BY sum(l.l_extendedprice * (1 - l.l_discount)) desc) as rank
+        FROM   snowflake_sample_data.tpch_sf1.lineitem l join snowflake_sample_data.tpch_sf1.orders o
+                ON l.l_orderkey = o.o_orderkey join snowflake_sample_data.tpch_sf1.customer c
+                ON o.o_custkey = c.c_custkey
+        GROUP BY customer_name)
+WHERE  rank <= 10;
+```
+## Run SQL
+This function is provided as a convenience. You can choose to run your SQL however you normally do and use the rest of the downstream functions.
+```python
+df = vn.get_results(cs, database, sql)
+```
+### Output:
+| customer_name | total_sales |
+| ------------- | ----------- |
+| Customer#000000001 |  68127.72 |
+| Customer#000000002 |  65898.69 |
+...
+## Generate Plotly Code
+```python
+plotly_code = vn.generate_plotly_code(question=my_question, sql=sql, df=df)
+```
+### Output:
+```python
+fig = go.Figure(go.Bar(
+    x=df['CUSTOMER_NAME'],
+    y=df['TOTAL_SALES'],
+    marker={'color': df['TOTAL_SALES'], 'colorscale': 'Viridis'},
+    text=df['TOTAL_SALES'],
+    textposition='auto',
+))
+fig.update_layout(
+    title="Top 10 Customers by Sales",
+    xaxis_title="Customer",
+    yaxis_title="Total Sales",
+    xaxis_tickangle=-45,
+    yaxis_tickprefix="$",
+)
+```
+## Run Plotly Code
+```python
+fig = vn.get_plotly_figure(plotly_code=plotly_code, df=df)
+fig.show()
+```
+### Output:
+![Top 10 Customers by Sales](docs/chart.png)
+## Improve Your Training Data
+```python
+vn.store_sql(
+    question=my_question,
+    sql=sql,
+)
+```
+## How Vanna Works
+```mermaid
+flowchart LR
+    DB[(Known Correct Question-SQL)]
+    Try[Try to Use DDL/Documentation]
+    SQL(SQL)
+    Check{Is the SQL correct?}
+    Generate[fa:fa-circle-question Use Examples to Generate]
+    DB --> Find
+    Question[fa:fa-circle-question Question] --> Find{fa:fa-magnifying-glass Do we have similar questions?}
+    Find -- Yes --> Generate
+    Find -- No --> Try
+    Generate --> SQL
+    Try --> SQL
+    SQL --> Check
+    Check -- Yes --> DB
+    Check -- No --> Analyst[fa:fa-glasses Analyst Writes the SQL]
+    Analyst -- Adds --> DB
+```
+# Getting Started
+## Install Vanna from PyPI and import it:
+```python
+%pip install vanna
+import vanna as vn
+```
+## Enter your email to set an API Key
+This will send a one-time code to your email address. Copy and paste the code into the prompt.
+```python
+my_email = '' # Enter your email here
+vn.login(email=my_email)
+```
+## Add Training Data
+```python
+vn.train(
+    question="Which products have the highest sales?",
+    sql="...",
+)
+```
+## Generate SQL
+```python
+sql = vn.generate_sql(question="Who are the top 10 customers?")
+```
+# Documentation
+[Full Documentation](https://docs.vanna.ai)

{vanna-0.0.6 → vanna-0.0.11}/pyproject.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [project]
 name = "vanna"
-version = "0.0.6"
+version = "0.0.11"
 authors = [
   { name="Zain Hoda", email="zain@vanna.ai" },
 ]
@@ -13,7 +13,7 @@ classifiers = [
     "Operating System :: OS Independent",
 ]
 dependencies = [
-    "requests", "tabulate", "plotly"
+    "requests", "tabulate", "plotly", "pandas"
 ]
 [project.urls]

{vanna-0.0.6 → vanna-0.0.11}/src/vanna/__init__.py RENAMED Viewed

@@ -4,7 +4,6 @@ Vanna.AI is a platform that allows you to ask questions about your data in plain
 # API Reference
 '''
-print("Vanna.AI Imported")
 import requests
 import pandas as pd
@@ -13,16 +12,41 @@ import dataclasses
 import plotly
 import plotly.express as px
 import plotly.graph_objects as go
-from .types import SQLAnswer, Explanation, QuestionSQLPair, Question, QuestionId, DataResult, PlotlyResult, Status, FullQuestionDocument, QuestionList, QuestionCategory, AccuracyStats, UserEmail, UserOTP, ApiKey, OrganizationList, Organization, NewOrganization
-from typing import List, Dict, Any, Union, Optional
+from .types import SQLAnswer, Explanation, QuestionSQLPair, Question, QuestionId, DataResult, PlotlyResult, Status, FullQuestionDocument, QuestionList, QuestionCategory, AccuracyStats, UserEmail, UserOTP, ApiKey, OrganizationList, Organization, NewOrganization, StringData, QuestionStringList, Visibility, NewOrganizationMember, DataFrameJSON
+from typing import List, Dict, Any, Union, Optional, Callable, Tuple
+import warnings
+import traceback
-"""Set the API key for Vanna.AI."""
 api_key: Union[str, None] = None # API key for Vanna.AI
+"""
+## Example
+```python
+# Login to Vanna.AI
+vn.login('user@example.com')
+print(vn.api_key)
+vn.api_key='my_api_key'
+```
+This is the API key for Vanna.AI. You can set it manually if you have it or use [`vn.login(...)`][vanna.login] to login and set it automatically.
+"""
+sql_to_df: Union[Callable[[str], pd.DataFrame], None] = None # Function to convert SQL to a Pandas DataFrame
+"""
+## Example
+```python
+vn.sql_to_df = lambda sql: pd.read_sql(sql, engine)
+```
+Set the SQL to DataFrame function for Vanna.AI. This is used in the [`vn.ask(...)`][vanna.ask] function.
+"""
 __org: Union[str, None] = None # Organization name for Vanna.AI
-_endpoint = "https://ask.vanna.ai/rpc"
-_unauthenticated_endpoint = "https://ask.vanna.ai/unauthenticated_rpc"
+_endpoint = "https://vanna-rpc-test-x5y3argz6q-uc.a.run.app/rpc"
+_unauthenticated_endpoint = "https://vanna-rpc-test-x5y3argz6q-uc.a.run.app/unauthenticated_rpc"
 def __unauthenticated_rpc_call(method, params):
     headers = {
@@ -41,10 +65,10 @@ def __rpc_call(method, params):
     global __org
     if api_key is None:
-        raise Exception("API key not set")
+        raise Exception("API key not set. Use vn.login(...) to login.")
     if __org is None and method != "list_orgs":
-        raise Exception("Organization name not set")
+        raise Exception("Datasets not set. Use vn.use_datasets([...]) to set the datasets to use.")
     if method != "list_orgs":
         headers = {
@@ -56,7 +80,7 @@ def __rpc_call(method, params):
         headers = {
             'Content-Type': 'application/json',
             'Vanna-Key': api_key,
-            'Vanna-Org': 'demo-sales'
+            'Vanna-Org': 'demo-tpc-h'
         }
     data = {
@@ -101,9 +125,9 @@ def login(email: str, otp_code: Union[str, None] = None) -> bool:
         if not status.success:
             return False
-        otp = input("Check your email for the code and enter it here: ")
+        otp_code = input("Check your email for the code and enter it here: ")
-    params = [UserOTP(email=email, otp=otp)]
+    params = [UserOTP(email=email, otp=otp_code)]
     d = __unauthenticated_rpc_call(method="verify_otp", params=params)
@@ -120,17 +144,17 @@ def login(email: str, otp_code: Union[str, None] = None) -> bool:
     return True
-def list_orgs() -> List[str]:
+def list_datasets() -> List[str]:
     """
     ## Example
     ```python
-    orgs = vn.list_orgs()
+    datasets = vn.list_datasets()
     ```
-    List the organizations that the user is a member of.
+    List the datasets that the user is a member of.
     Returns:
-        List[str]: A list of organization names.
+        List[str]: A list of dataset names.
     """
     d = __rpc_call(method="list_orgs", params=[])
@@ -141,23 +165,23 @@ def list_orgs() -> List[str]:
     return orgs.organizations
-def create_org(org: str, db_type: str) -> bool:
+def create_dataset(dataset: str, db_type: str) -> bool:
     """
     ## Example
     ```python
-    vn.create_org(org="my-org", db_type="postgres")
+    vn.create_dataset(dataset="my-dataset", db_type="postgres")
     ```
-    Create a new organization.
+    Create a new dataset.
     Args:
-        org (str): The name of the organization to create.
-        db_type (str): The type of database to use for the organization. This can be "Snowflake", "BigQuery", "Postgres", or anything else.
+        dataset (str): The name of the dataset to create.
+        db_type (str): The type of database to use for the dataset. This can be "Snowflake", "BigQuery", "Postgres", or anything else.
     Returns:
-        bool: True if the organization was created successfully, False otherwise.
+        bool: True if the dataset was created successfully, False otherwise.
     """
-    params = [NewOrganization(org_name=org, db_type=db_type)]
+    params = [NewOrganization(org_name=dataset, db_type=db_type)]
     d = __rpc_call(method="create_org", params=params)
@@ -168,46 +192,115 @@ def create_org(org: str, db_type: str) -> bool:
     return status.success
+def add_user_to_dataset(dataset: str, email: str, is_admin: bool) -> bool:
+    """
+    ## Example
+    ```python
+    vn.add_user_to_dataset(dataset="my-dataset", email="user@example.com")
+    ```
+    Add a user to an dataset.
-def set_org(org: str) -> None:
+    Args:
+        dataset (str): The name of the dataset to add the user to.
+        email (str): The email address of the user to add.
+        is_admin (bool): Whether or not the user should be an admin.
+    Returns:
+        bool: True if the user was added successfully, False otherwise.
+    """
+    params = [NewOrganizationMember(org_name=dataset, email=email, is_admin=is_admin)]
+    d = __rpc_call(method="add_user_to_org", params=params)
+    if 'result' not in d:
+        return False
+    status = Status(**d['result'])
+    if not status.success:
+        print(status.message)
+    return status.success
+def set_dataset_visibility(visibility: bool) -> bool:
     """
     ## Example
     ```python
-    vn.set_org("my-org")
+    vn.set_dataset_visibility(visibility=True)
     ```
-    Set the organization name for the Vanna.AI API.
+    Set the visibility of the current dataset. If a dataset is visible, anyone can see it. If it is not visible, only members of the dataset can see it.
     Args:
-        org (str): The organization name.
+        visibility (bool): Whether or not the dataset should be publicly visible.
+    Returns:
+        bool: True if the dataset visibility was set successfully, False otherwise.
     """
+    params = [Visibility(visibility=visibility)]
+    d = __rpc_call(method="set_org_visibility", params=params)
+    if 'result' not in d:
+        return False
+    status = Status(**d['result'])
+    return status.success
+def _set_org(org: str) -> None:
     global __org
-    my_orgs = list_orgs()
+    my_orgs = list_datasets()
     if org not in my_orgs:
         # Check if org exists
         d = __unauthenticated_rpc_call(method="check_org_exists", params=[Organization(name=org, user=None, connection=None)])
         if 'result' not in d:
-            raise Exception("Failed to check if organization exists")
+            raise Exception("Failed to check if dataset exists")
         status = Status(**d['result'])
         if status.success:
             raise Exception(f"An organization with the name {org} already exists")
-        create = input(f"Would you like to create organization '{org}'? (y/n): ")
+        create = input(f"Would you like to create dataset '{org}'? (y/n): ")
         if create.lower() == 'y':
             db_type = input("What type of database would you like to use? (Snowflake, BigQuery, Postgres, etc.): ")
-            if create_org(org=org, db_type=db_type):
+            __org = 'demo-tpc-h'
+            if create_dataset(dataset=org, db_type=db_type):
                 __org = org
             else:
-                raise Exception("Failed to create organization")
+                __org = None
+                raise Exception("Failed to create dataset")
     else:
         __org = org
-def store_sql(question: str, sql: str) -> bool:
+def use_datasets(datasets: List[str]):
+    """
+    ## Example
+    ```python
+    vn.use_datasets(datasets=["employees", "departments"])
+    ```
+    Set the datasets to use for the Vanna.AI API.
+    Args:
+        datasets (List[str]): A list of dataset names.
+    Returns:
+        bool: True if the datasets were set successfully, False otherwise.
+    """
+    if len(datasets) >= 1:
+        _set_org(org=datasets[0])
+    else:
+        raise Exception("No datasets provided")
+def store_sql(question: str, sql: str, tag: Union[str, None] = "Manually Trained") -> bool:
     """
     ## Example
     ```python
@@ -222,10 +315,12 @@ def store_sql(question: str, sql: str) -> bool:
     Args:
         question (str): The question to store.
         sql (str): The SQL query to store.
+        tag (Union[str, None]): A tag to associate with the question and SQL query.
     """
     params = [QuestionSQLPair(
         question=question,
         sql=sql,
+        tag=tag
     )]
     d = __rpc_call(method="store_sql", params=params)
@@ -237,6 +332,56 @@ def store_sql(question: str, sql: str) -> bool:
     return status.success
+def store_ddl(ddl: str) -> bool:
+    """
+    ## Example
+    ```python
+    vn.store_ddl(
+        ddl="CREATE TABLE employees (id INT, name VARCHAR(255), salary INT)"
+    )
+    ```
+    Store a DDL statement in the Vanna.AI database.
+    Args:
+        ddl (str): The DDL statement to store.
+    """
+    params = [StringData(data=ddl)]
+    d = __rpc_call(method="store_ddl", params=params)
+    if 'result' not in d:
+        return False
+    status = Status(**d['result'])
+    return status.success
+def store_documentation(documentation: str) -> bool:
+    """
+    ## Example
+    ```python
+    vn.store_documentation(
+        documentation="Our organization's definition of sales is the discount price of an item multiplied by the quantity sold."
+    )
+    ```
+    Store a documentation string in the Vanna.AI database.
+    Args:
+        documentation (str): The documentation string to store.
+    """
+    params = [StringData(data=documentation)]
+    d = __rpc_call(method="store_documentation", params=params)
+    if 'result' not in d:
+        return False
+    status = Status(**d['result'])
+    return status.success
 def train(question: str, sql: str) -> bool:
     """
     ## Example
@@ -261,7 +406,7 @@ def flag_sql_for_review(question: str, sql: Union[str, None] = None, error_msg:
     ```python
     vn.flag_sql_for_review(question="What is the average salary of employees?")
     ```
-    Flag a question and its corresponding SQL query for review. You can later retrieve the flagged questions using [`get_flagged_questions()`][vanna.get_flagged_questions].
+    Flag a question and its corresponding SQL query for review. You can see the tag show up in [`vn.get_all_questions()`][vanna.get_all_questions]
     Args:
         question (str): The question to flag.
@@ -287,6 +432,34 @@ def flag_sql_for_review(question: str, sql: Union[str, None] = None, error_msg:
     return status.success
+# def read_questions_from_github(url: str) -> List[QuestionSQLPair]:
+#     """
+#     ## Example
+#     ```python
+#     url = "https://raw.githubusercontent.com/vanna-ai/vanna-ai/main/data/questions.json"
+#     questions = vn.read_questions_from_github(url)
+#     ```
+#     Read questions and SQL queries from a GitHub URL.
+#     Args:
+#         url (str): The URL to read from.
+#     Returns:
+#         List[QuestionSQLPair]: A list of [`QuestionSQLPair`][vanna.QuestionSQLPair] objects.
+#     """
+#     response = requests.get(url)
+#     data = response.json()
+#     question_sql_pairs = []
+#     for item in data:
+#         question = item.get('question')
+#         sql = item.get('sql')
+#         if question and sql:
+#             question_sql_pair = QuestionSQLPair(question=question, sql=sql)
+#             question_sql_pairs.append(question_sql_pair)
+#     return question_sql_pairs
 def remove_sql(question: str) -> bool:
     """
     ## Example
@@ -337,7 +510,65 @@ def generate_sql(question: str) -> str:
     return sql_answer.sql
-def ask(question: str) -> str:
+def generate_followup_questions(question: str, df: pd.DataFrame) -> List[str]:
+    """
+    ## Example
+    ```python
+    vn.generate_followup_questions(question="What is the average salary of employees?", df=df)
+    # ['What is the average salary of employees in the Sales department?', 'What is the average salary of employees in the Engineering department?', ...]
+    ```
+    Generate follow-up questions using the Vanna.AI API.
+    Args:
+        question (str): The question to generate follow-up questions for.
+        df (pd.DataFrame): The DataFrame to generate follow-up questions for.
+    Returns:
+        List[str] or None: The follow-up questions, or None if an error occurred.
+    """
+    params = [DataResult(
+        question=question,
+        sql=None,
+        table_markdown=df.head().to_markdown(),
+        error=None,
+        correction_attempts=0,
+    )]
+    d = __rpc_call(method="generate_followup_questions", params=params)
+    if 'result' not in d:
+        return None
+    # Load the result into a dataclass
+    question_string_list = QuestionStringList(**d['result'])
+    return question_string_list.questions
+def generate_questions() -> List[str]:
+    """
+    ## Example
+    ```python
+    vn.generate_questions()
+    # ['What is the average salary of employees?', 'What is the total salary of employees?', ...]
+    ```
+    Generate questions using the Vanna.AI API.
+    Returns:
+        List[str] or None: The questions, or None if an error occurred.
+    """
+    d = __rpc_call(method="generate_questions", params=[])
+    if 'result' not in d:
+        return None
+    # Load the result into a dataclass
+    question_string_list = QuestionStringList(**d['result'])
+    return question_string_list.questions
+def ask(question: Union[str, None] = None, print_results: bool = True, auto_train: bool = True) -> Tuple[Union[str, None], Union[pd.DataFrame, None], Union[plotly.graph_objs.Figure, None]]:
     """
     ## Example
     ```python
@@ -345,15 +576,61 @@ def ask(question: str) -> str:
     # SELECT AVG(salary) FROM employees
     ```
-    Ask a question using the Vanna.AI API. This is equivalent to calling [`generate_sql()`][vanna.generate_sql].
+    Ask a question using the Vanna.AI API. This generates an SQL query, runs it, and returns the results in a dataframe and a Plotly figure.
     Args:
-        question (str): The question to ask.
+        question (str): The question to ask. If None, you will be prompted to enter a question.
     Returns:
         str or None: The SQL query, or None if an error occurred.
+        pd.DataFrame or None: The results of the SQL query, or None if an error occurred.
+        plotly.graph_objs.Figure or None: The Plotly figure, or None if an error occurred.
     """
-    return generate_sql(question=question)
+    if question is None:
+        question = input("Enter a question: ")
+    try:
+        sql = generate_sql(question=question)
+    except Exception as e:
+        print(e)
+        return None, None, None
+    if print_results:
+        print(sql)
+    if sql_to_df is None:
+        print("If you want to run the SQL query, provide a vn.sql_to_df function.")
+        return sql, None, None
+    try:
+        df = sql_to_df(sql=sql)
+        if print_results:
+            print(df.head().to_markdown())
+        if len(df) > 0 and auto_train:
+            store_sql(question=question, sql=sql, tag="SQL Ran")
+        try:
+            plotly_code = generate_plotly_code(question=question, sql=sql, df=df)
+            fig = get_plotly_figure(plotly_code=plotly_code, df=df)
+            if print_results:
+                fig.show()
+            return sql, df, fig
+        except Exception as e:
+            # Print stack trace
+            traceback.print_exc()
+            print("Couldn't run plotly code: ", e)
+            return sql, df, None
+    except Exception as e:
+        print("Couldn't run sql: ", e)
+        return sql, None, None
 def generate_plotly_code(question: Union[str, None], sql: Union[str, None], df: pd.DataFrame) -> str:
     """
@@ -428,10 +705,7 @@ def get_plotly_figure(plotly_code: str, df: pd.DataFrame, dark_mode: bool = True
 def get_results(cs, default_database: str, sql: str) -> pd.DataFrame:
     """
-    ## Example
-    ```python
-    df = vn.get_results(cs=cs, default_database="PUBLIC", sql="SELECT * FROM students")
-    ```
+    DEPRECATED. Use `vn.sql_to_df` instead.
     Run the SQL query and return the results as a pandas dataframe. This is just a helper function that does not use the Vanna.AI API.
     Args:
@@ -442,6 +716,9 @@ def get_results(cs, default_database: str, sql: str) -> pd.DataFrame:
     Returns:
         pd.DataFrame: The results of the SQL query.
     """
+    print("`vn.get_results()` is deprecated. Use `vn.sql_to_df()` instead.")
+    warnings.warn("`vn.get_results()` is deprecated. Use `vn.sql_to_df()` instead.")
     cs.execute(f"USE DATABASE {default_database}")
     cur = cs.execute(sql)
@@ -524,55 +801,31 @@ def generate_question(sql: str) -> str:
     return question.question
-def get_flagged_questions() -> QuestionList:
+def get_all_questions() -> pd.DataFrame:
     """
     ## Example
     ```python
-    questions = vn.get_flagged_questions()
+    questions = vn.get_all_questions()
     ```
-    Get a list of flagged questions from the Vanna.AI API.
+    Get a list of questions from the Vanna.AI API.
     Returns:
-        List[FullQuestionDocument] or None: The list of flagged questions, or None if an error occurred.
+        pd.DataFrame or None: The list of questions, or None if an error occurred.
     """
     # params = [Question(question="")]
     params = []
-    d = __rpc_call(method="get_flagged_questions", params=params)
+    d = __rpc_call(method="get_all_questions", params=params)
     if 'result' not in d:
         return None
     # Load the result into a dataclass
-    flagged_questions = QuestionList(**d['result'])
-    return flagged_questions
-def get_accuracy_stats() -> AccuracyStats:
-    """
-    ## Example
-    ```python
-    vn.get_accuracy_stats()
-    ```
-    Get the accuracy statistics from the Vanna.AI API.
-    Returns:
-        dict or None: The accuracy statistics, or None if an error occurred.
-    """
-    params = []
-    d = __rpc_call(method="get_accuracy_stats", params=params)
+    all_questions = DataFrameJSON(**d['result'])
-    if 'result' not in d:
-        return None
+    df = pd.read_json(all_questions.data)
-    # Load the result into a dataclass
-    accuracy_stats = AccuracyStats(**d['result'])
-    return accuracy_stats
+    return df

{vanna-0.0.6 → vanna-0.0.11}/src/vanna/types.py RENAMED Viewed

@@ -1,5 +1,5 @@
 from __future__ import annotations
-from typing import List, Dict
+from typing import List, Dict, Union
 from dataclasses import dataclass
 @dataclass
@@ -23,6 +23,7 @@ class FullQuestionDocument:
 class QuestionSQLPair:
     question: str
     sql: str
+    tag: Union[str, None]
 @dataclass
 class Organization:
@@ -34,6 +35,14 @@ class Organization:
 class OrganizationList:
     organizations: List[str]
+@dataclass
+class QuestionStringList:
+    questions: List[str]
+@dataclass
+class Visibility:
+    visibility: bool
 @dataclass
 class UserEmail:
     email: str
@@ -43,6 +52,12 @@ class NewOrganization:
     org_name: str
     db_type: str
+@dataclass
+class NewOrganizationMember:
+    org_name: str
+    email: str
+    is_admin: bool
 @dataclass
 class UserOTP:
     email: str
@@ -68,7 +83,7 @@ class QuestionCategory:
     NO_SQL_GENERATED = "No SQL Generated"
     SQL_UNABLE_TO_RUN = "SQL Unable to Run"
     BOOTSTRAP_TRAINING_QUERY = "Bootstrap Training Query"
-    ASSUMED_CORRECT = "Assumed Correct"
+    SQL_RAN = "SQL Ran Successfully"
     FLAGGED_FOR_REVIEW = "Flagged for Review"
     REVIEWED_AND_APPROVED = "Reviewed and Approved"
     REVIEWED_AND_REJECTED = "Reviewed and Rejected"
@@ -140,4 +155,12 @@ class ColumnDefinition:
 @dataclass
 class Diagram:
     raw: str
-    mermaid_code: str
+    mermaid_code: str
+@dataclass
+class StringData:
+    data: str
+@dataclass
+class DataFrameJSON:
+    data: str

vanna-0.0.11/src/vanna.egg-info/PKG-INFO ADDED Viewed

@@ -0,0 +1,156 @@
+Metadata-Version: 2.1
+Name: vanna
+Version: 0.0.11
+Summary: Generate SQL queries from natural language
+Author-email: Zain Hoda <zain@vanna.ai>
+Project-URL: Homepage, https://github.com/vanna-ai/vanna-py
+Project-URL: Bug Tracker, https://github.com/vanna-ai/vanna-py/issues
+Classifier: Programming Language :: Python :: 3
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Operating System :: OS Independent
+Requires-Python: >=3.7
+Description-Content-Type: text/markdown
+License-File: LICENSE
+| GitHub | PyPI | Colab | Documentation |
+| ------ | ---- | ----- | ------------- |
+| [![GitHub](https://img.shields.io/badge/GitHub-vanna--py-blue?logo=github)](https://github.com/vanna-ai/vanna-py) | [![PyPI](https://img.shields.io/pypi/v/vanna?logo=pypi)](https://pypi.org/project/vanna/) | [![Colab](https://img.shields.io/badge/Colab-vanna--py-blue?logo=google-colab)](https://colab.research.google.com/github/vanna-ai/vanna-py/blob/main/notebooks/vn-starter.ipynb) | [![Documentation](https://img.shields.io/badge/Documentation-vanna--py-blue?logo=read-the-docs)](https://docs.vanna.ai) |
+# Vanna.AI
+Vanna is a Python-based AI SQL co-pilot. Our initial users are data-savvy data analysts, data scientists, engineers, and similar people that use Vanna to automate writing complex SQL.
+Vanna can:
+- [Convert natural language to SQL](#natural-language-to-sql)
+- [Run SQL](#run-sql)
+- [Generate Plotly code](#generate-plotly-code)
+- [Run Plotly code](#run-plotly-code)
+- [Get better over time](#improve-your-training-data)
+- Be used in a Jupyter Notebooks, Colab, or other Python environments
+- Be used with Snowflake, BigQuery, and other databases
+- Be used with Python UIs, such as [Streamlit](https://github.com/vanna-ai/vanna-streamlit), Dash, and others
+- Be used to make Slack bots
+## Natural Language to SQL
+```python
+sql = vn.generate_sql(question='Who are the top 10 customers?')
+```
+### Output:
+```sql
+SELECT customer_name,
+       total_sales
+FROM   (SELECT c.c_name as customer_name,
+               sum(l.l_extendedprice * (1 - l.l_discount)) as total_sales,
+               row_number() OVER (ORDER BY sum(l.l_extendedprice * (1 - l.l_discount)) desc) as rank
+        FROM   snowflake_sample_data.tpch_sf1.lineitem l join snowflake_sample_data.tpch_sf1.orders o
+                ON l.l_orderkey = o.o_orderkey join snowflake_sample_data.tpch_sf1.customer c
+                ON o.o_custkey = c.c_custkey
+        GROUP BY customer_name)
+WHERE  rank <= 10;
+```
+## Run SQL
+This function is provided as a convenience. You can choose to run your SQL however you normally do and use the rest of the downstream functions.
+```python
+df = vn.get_results(cs, database, sql)
+```
+### Output:
+| customer_name | total_sales |
+| ------------- | ----------- |
+| Customer#000000001 |  68127.72 |
+| Customer#000000002 |  65898.69 |
+...
+## Generate Plotly Code
+```python
+plotly_code = vn.generate_plotly_code(question=my_question, sql=sql, df=df)
+```
+### Output:
+```python
+fig = go.Figure(go.Bar(
+    x=df['CUSTOMER_NAME'],
+    y=df['TOTAL_SALES'],
+    marker={'color': df['TOTAL_SALES'], 'colorscale': 'Viridis'},
+    text=df['TOTAL_SALES'],
+    textposition='auto',
+))
+fig.update_layout(
+    title="Top 10 Customers by Sales",
+    xaxis_title="Customer",
+    yaxis_title="Total Sales",
+    xaxis_tickangle=-45,
+    yaxis_tickprefix="$",
+)
+```
+## Run Plotly Code
+```python
+fig = vn.get_plotly_figure(plotly_code=plotly_code, df=df)
+fig.show()
+```
+### Output:
+![Top 10 Customers by Sales](docs/chart.png)
+## Improve Your Training Data
+```python
+vn.store_sql(
+    question=my_question,
+    sql=sql,
+)
+```
+## How Vanna Works
+```mermaid
+flowchart LR
+    DB[(Known Correct Question-SQL)]
+    Try[Try to Use DDL/Documentation]
+    SQL(SQL)
+    Check{Is the SQL correct?}
+    Generate[fa:fa-circle-question Use Examples to Generate]
+    DB --> Find
+    Question[fa:fa-circle-question Question] --> Find{fa:fa-magnifying-glass Do we have similar questions?}
+    Find -- Yes --> Generate
+    Find -- No --> Try
+    Generate --> SQL
+    Try --> SQL
+    SQL --> Check
+    Check -- Yes --> DB
+    Check -- No --> Analyst[fa:fa-glasses Analyst Writes the SQL]
+    Analyst -- Adds --> DB
+```
+# Getting Started
+## Install Vanna from PyPI and import it:
+```python
+%pip install vanna
+import vanna as vn
+```
+## Enter your email to set an API Key
+This will send a one-time code to your email address. Copy and paste the code into the prompt.
+```python
+my_email = '' # Enter your email here
+vn.login(email=my_email)
+```
+## Add Training Data
+```python
+vn.train(
+    question="Which products have the highest sales?",
+    sql="...",
+)
+```
+## Generate SQL
+```python
+sql = vn.generate_sql(question="Who are the top 10 customers?")
+```
+# Documentation
+[Full Documentation](https://docs.vanna.ai)

{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/requires.txt RENAMED Viewed

@@ -1,3 +1,4 @@
 requests
 tabulate
 plotly
+pandas

vanna-0.0.6/PKG-INFO DELETED Viewed

@@ -1,20 +0,0 @@
-Metadata-Version: 2.1
-Name: vanna
-Version: 0.0.6
-Summary: Generate SQL queries from natural language
-Author-email: Zain Hoda <zain@vanna.ai>
-Project-URL: Homepage, https://github.com/vanna-ai/vanna-py
-Project-URL: Bug Tracker, https://github.com/vanna-ai/vanna-py/issues
-Classifier: Programming Language :: Python :: 3
-Classifier: License :: OSI Approved :: MIT License
-Classifier: Operating System :: OS Independent
-Requires-Python: >=3.7
-Description-Content-Type: text/markdown
-License-File: LICENSE
-# Vanna.AI
-Vanna.AI is a tool to help you generate SQL from natural language.
-# Documentation
-[Full Documentation Reference](https://docs.vanna.ai)

vanna-0.0.6/README.md DELETED Viewed

@@ -1,6 +0,0 @@
-# Vanna.AI
-Vanna.AI is a tool to help you generate SQL from natural language.
-# Documentation
-[Full Documentation Reference](https://docs.vanna.ai)

vanna-0.0.6/src/vanna.egg-info/PKG-INFO DELETED Viewed

@@ -1,20 +0,0 @@
-Metadata-Version: 2.1
-Name: vanna
-Version: 0.0.6
-Summary: Generate SQL queries from natural language
-Author-email: Zain Hoda <zain@vanna.ai>
-Project-URL: Homepage, https://github.com/vanna-ai/vanna-py
-Project-URL: Bug Tracker, https://github.com/vanna-ai/vanna-py/issues
-Classifier: Programming Language :: Python :: 3
-Classifier: License :: OSI Approved :: MIT License
-Classifier: Operating System :: OS Independent
-Requires-Python: >=3.7
-Description-Content-Type: text/markdown
-License-File: LICENSE
-# Vanna.AI
-Vanna.AI is a tool to help you generate SQL from natural language.
-# Documentation
-[Full Documentation Reference](https://docs.vanna.ai)

{vanna-0.0.6 → vanna-0.0.11}/LICENSE RENAMED Viewed

File without changes

{vanna-0.0.6 → vanna-0.0.11}/setup.cfg RENAMED Viewed

File without changes

{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/SOURCES.txt RENAMED Viewed

File without changes

{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/dependency_links.txt RENAMED Viewed

File without changes

{vanna-0.0.6 → vanna-0.0.11}/src/vanna.egg-info/top_level.txt RENAMED Viewed

File without changes

vanna 0.0.6__tar.gz → 0.0.11__tar.gz

vanna 0.0.6tar.gz → 0.0.11tar.gz