Skip to main content
NEW · APP STORE Now on iOS · macOS · iPad Android & Windows soon GET IT
Prompts Big Data Cloud Automation via Rube MCP

agent tool_use skill risk: high

Big Data Cloud Automation via Rube MCP

Instructs on automating Big Data Cloud operations through Composio's toolkit via Rube MCP, requiring RUBE_SEARCH_TOOLS calls first to obtain current schemas before using RUBE_MANAG…

  • Policy sensitive
  • Human review
  • External action: high

SKILL 1 file

SKILL.md
---
name: big-data-cloud-automation
description: "Automate Big Data Cloud tasks via Rube MCP (Composio). Always search tools first for current schemas."
---
# Big Data Cloud Automation via Rube MCP

Automate Big Data Cloud operations through Composio's Big Data Cloud toolkit via Rube MCP.

**Toolkit docs**: [composio.dev/toolkits/big_data_cloud](https://composio.dev/toolkits/big_data_cloud)

## Prerequisites

- Rube MCP must be connected (RUBE_SEARCH_TOOLS available)
- Active Big Data Cloud connection via `RUBE_MANAGE_CONNECTIONS` with toolkit `big_data_cloud`
- Always call `RUBE_SEARCH_TOOLS` first to get current tool schemas

## Setup

**Get Rube MCP**: Add `https://rube.app/mcp` as an MCP server in your client configuration. No API keys needed — just add the endpoint and it works.

1. Verify Rube MCP is available by confirming `RUBE_SEARCH_TOOLS` responds
2. Call `RUBE_MANAGE_CONNECTIONS` with toolkit `big_data_cloud`
3. If connection is not ACTIVE, follow the returned auth link to complete setup
4. Confirm connection status shows ACTIVE before running any workflows

## Tool Discovery

Always discover available tools before executing workflows:

```
RUBE_SEARCH_TOOLS
queries: [{use_case: "Big Data Cloud operations", known_fields: ""}]
session: {generate_id: true}
```

This returns available tool slugs, input schemas, recommended execution plans, and known pitfalls.

## Core Workflow Pattern

### Step 1: Discover Available Tools

```
RUBE_SEARCH_TOOLS
queries: [{use_case: "your specific Big Data Cloud task"}]
session: {id: "existing_session_id"}
```

### Step 2: Check Connection

```
RUBE_MANAGE_CONNECTIONS
toolkits: ["big_data_cloud"]
session_id: "your_session_id"
```

### Step 3: Execute Tools

```
RUBE_MULTI_EXECUTE_TOOL
tools: [{
  tool_slug: "TOOL_SLUG_FROM_SEARCH",
  arguments: {/* schema-compliant args from search results */}
}]
memory: {}
session_id: "your_session_id"
```

## Known Pitfalls

- **Always search first**: Tool schemas change. Never hardcode tool slugs or arguments without calling `RUBE_SEARCH_TOOLS`
- **Check connection**: Verify `RUBE_MANAGE_CONNECTIONS` shows ACTIVE status before executing tools
- **Schema compliance**: Use exact field names and types from the search results
- **Memory parameter**: Always include `memory` in `RUBE_MULTI_EXECUTE_TOOL` calls, even if empty (`{}`)
- **Session reuse**: Reuse session IDs within a workflow. Generate new ones for new workflows
- **Pagination**: Check responses for pagination tokens and continue fetching until complete

## Quick Reference

| Operation | Approach |
|-----------|----------|
| Find tools | `RUBE_SEARCH_TOOLS` with Big Data Cloud-specific use case |
| Connect | `RUBE_MANAGE_CONNECTIONS` with toolkit `big_data_cloud` |
| Execute | `RUBE_MULTI_EXECUTE_TOOL` with discovered tool slugs |
| Bulk ops | `RUBE_REMOTE_WORKBENCH` with `run_composio_tool()` |
| Full schema | `RUBE_GET_TOOL_SCHEMAS` for tools with `schemaRef` |

---
*Powered by [Composio](https://composio.dev)*

INPUTS

use_case REQUIRED

specific Big Data Cloud task description for tool search

e.g. Big Data Cloud operations

session_id

session identifier for reusing context

e.g. existing_session_id

tool_slug REQUIRED

discovered tool identifier from search results

e.g. TOOL_SLUG_FROM_SEARCH

REQUIRED CONTEXT

  • RUBE_SEARCH_TOOLS availability
  • active big_data_cloud connection via RUBE_MANAGE_CONNECTIONS

OPTIONAL CONTEXT

  • existing session_id

TOOLS REQUIRED

  • RUBE_SEARCH_TOOLS
  • RUBE_MANAGE_CONNECTIONS
  • RUBE_MULTI_EXECUTE_TOOL
  • RUBE_REMOTE_WORKBENCH
  • RUBE_GET_TOOL_SCHEMAS

ROLES & RULES

  1. Always search tools first for current schemas
  2. Always call RUBE_SEARCH_TOOLS first to get current tool schemas
  3. Always discover available tools before executing workflows
  4. Verify Rube MCP is available by confirming RUBE_SEARCH_TOOLS responds
  5. Call RUBE_MANAGE_CONNECTIONS with toolkit big_data_cloud
  6. Confirm connection status shows ACTIVE before running any workflows
  7. Never hardcode tool slugs or arguments without calling RUBE_SEARCH_TOOLS
  8. Verify RUBE_MANAGE_CONNECTIONS shows ACTIVE status before executing tools
  9. Use exact field names and types from the search results
  10. Always include memory in RUBE_MULTI_EXECUTE_TOOL calls, even if empty
  11. Reuse session IDs within a workflow
  12. Generate new session IDs for new workflows
  13. Check responses for pagination tokens and continue fetching until complete

EXPECTED OUTPUT

Format
markdown
Constraints
  • include prerequisites and setup steps
  • always start with RUBE_SEARCH_TOOLS
  • use exact function call formats shown
  • list known pitfalls

SUCCESS CRITERIA

  • Rube MCP must be connected
  • Active Big Data Cloud connection via RUBE_MANAGE_CONNECTIONS
  • Always call RUBE_SEARCH_TOOLS first
  • Confirm connection status shows ACTIVE

FAILURE MODES

  • Tool schemas change so never hardcode without searching first
  • May attempt execution without active connection
  • May use incorrect field names or types
  • May omit memory parameter in execute calls
  • May fail to reuse or correctly manage session IDs
  • May stop before handling all pagination tokens

CAVEATS

Dependencies
  • Rube MCP must be connected (RUBE_SEARCH_TOOLS available)
  • Active Big Data Cloud connection via RUBE_MANAGE_CONNECTIONS
Missing context
  • Target model or agent framework expected to consume the prompt
  • Preferred output format when reporting workflow results
Ambiguities
  • Placeholder text such as "your specific Big Data Cloud task" and "your_session_id" left for user substitution without explicit markup.

QUALITY

OVERALL
0.81
CLARITY
0.78
SPECIFICITY
0.85
REUSABILITY
0.82
COMPLETENESS
0.80

IMPROVEMENT SUGGESTIONS

  • Wrap all user-supplied values in clearly delimited placeholders (e.g., {{use_case}}, {{session_id}}) and add a short "Variables" section at the top.
  • Add a one-sentence success criterion at the end of the Core Workflow Pattern section.

USAGE

Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.

MORE FOR AGENT