Skip to main content
Bag of words connects to virtually any data source, bringing your data directly into AI-powered analysis. Whether your data lives in a traditional database, a modern cloud warehouse, or specialized business services, you can connect it in minutes and start asking questions immediately. Connect from anywhere:
  • Databases & Warehouses: PostgreSQL, MySQL, Snowflake, BigQuery, Redshift, and more - connect directly to your data infrastructure
  • Business Services: Tableau, Salesforce, NetSuite, AWS Cost Explorer, Google Analytics - pull data from the tools you use every day
No data movement required. Bag of words queries your data where it lives, respecting your existing security and access controls. The AI agent understands your schema, relationships, and business context to generate accurate queries and insights.

Data Source

In addition to basic connection settings, the data source entity will hold which tables are accessible to AI agents, context enrichers (tableau, dbt, etc), conversation starters, access control, and more

Tables

Manage which tables are available to the AI agent by enabling or disabling tables to control visibility, searching for specific tables quickly, reloading the list if your schema changes, and viewing tables that exist but aren’t currently enabled. Select Tables
Start with your most important tables. You can always enable more later as needed.

Enrich your data souce with Tableau, dbt, AGENTS.md

Link additional metadata and documentation to enrich the AI’s understanding: Metadata/Resources: Connect external resources that provide semantic context:
  • dbt: Link your dbt project to pull in model descriptions, column documentation, and relationships
  • Tableau: Import data models and calculated fields from Tableau workbooks
  • LookML: Connect Looker models to leverage existing business logic
  • Markdown files (AGENTS.md): Include custom documentation files from your git repository
  • Coding files: python, sql
These integrations allow the AI to use your existing data documentation automatically. Connect Metadata Resources Integration is via git repository - point Bag of words to your repo and it syncs automatically.

Access

Under the settings tab, you can control who can use this data source and how they authenticate.
  • Members: Add or remove team members who can access this data source. Each member can have different permission levels.
  • User Authentication: Set user auth required, to require each user to enter their own database credentials rather than using shared system credentials.

Description & Conversation Starters

Description: Explain what this data source contains - the business domain, key entities, and how data is structured. The richer your description, the better the AI understands your data. Example:
This data source is a database for a movie rental business. It tracks movies (films), 
actors, customers, rentals, payments, and stores. Key tables include:

• film: details about each movie
• actor: information about actors
• customer: customer contact and account info
• rental: records of movie rentals by customers
• payment: payments made by customers
• store: store locations and staff

Movies are linked to actors (film_actor), categories (film_category), and inventory 
(inventory). Customers are linked to rentals and payments. Staff manage stores and 
transactions.
Data Source Overview Conversation Starters: Pre-defined prompts that appear as chips in home page, helping users quickly explore common analyses. Example starters:
  • “Top Renting Actors”
  • “Most Popular Film Categories”
  • “Inactive Customers”
  • “Store Performance Comparison”
Click Edit to modify the title or the full prompt starters at any time.

Supported Data Sources

PostgreSQLDatabase/Warehouse
SnowflakeDatabase/Warehouse
Google BigQueryDatabase/Warehouse
NetSuiteService
MySQLDatabase/Warehouse
AWS AthenaDatabase/Warehouse
MariaDBDatabase/Warehouse
SalesforceService
Microsoft SQL ServerDatabase/Warehouse
ClickHouseDatabase/Warehouse
AWS Cost ExplorerService
VerticaDatabase/Warehouse
AWS RedshiftDatabase/Warehouse
TableauService
DuckDBDatabase/Warehouse
Apache PinotDatabase/Warehouse
Oracle DBDatabase/Warehouse
Your data source is missing? Create a PR or open an issue in the github repo: https://github.com/bagofwords1/bagofwords

Best Practices

  • Provide rich descriptions: The more context you give about your data, the better the AI performs
  • Create helpful conversation starters: Guide users toward common analyses
  • Enable relevant tables only: Don’t overwhelm the AI with unused tables
  • Use descriptive names: Name your connections clearly (e.g., “Production DB” vs “Dev DB”)
  • Add instructions: Create data source-specific instructions for calculation logic or business rules
  • Manage access carefully: Use member permissions and user authentication for sensitive data