agent tool_use skill risk: high

Big Data Cloud Automation via Rube MCP

Instructs on automating Big Data Cloud operations through Composio's toolkit via Rube MCP, requiring RUBE_SEARCH_TOOLS calls first to obtain current schemas before using RUBE_MANAG…

Policy sensitive
Human review
External action: high

SKILL 1 file

SKILL.md

Download

---
name: big-data-cloud-automation
description: "Automate Big Data Cloud tasks via Rube MCP (Composio). Always search tools first for current schemas."
---
# Big Data Cloud Automation via Rube MCP

Automate Big Data Cloud operations through Composio's Big Data Cloud toolkit via Rube MCP.

**Toolkit docs**: [composio.dev/toolkits/big_data_cloud](https://composio.dev/toolkits/big_data_cloud)

## Prerequisites

- Rube MCP must be connected (RUBE_SEARCH_TOOLS available)
- Active Big Data Cloud connection via `RUBE_MANAGE_CONNECTIONS` with toolkit `big_data_cloud`
- Always call `RUBE_SEARCH_TOOLS` first to get current tool schemas

## Setup

**Get Rube MCP**: Add `https://rube.app/mcp` as an MCP server in your client configuration. No API keys needed — just add the endpoint and it works.

1. Verify Rube MCP is available by confirming `RUBE_SEARCH_TOOLS` responds
2. Call `RUBE_MANAGE_CONNECTIONS` with toolkit `big_data_cloud`
3. If connection is not ACTIVE, follow the returned auth link to complete setup
4. Confirm connection status shows ACTIVE before running any workflows

## Tool Discovery

Always discover available tools before executing workflows:

```
RUBE_SEARCH_TOOLS
queries: [{use_case: "Big Data Cloud operations", known_fields: ""}]
session: {generate_id: true}
```

This returns available tool slugs, input schemas, recommended execution plans, and known pitfalls.

## Core Workflow Pattern

### Step 1: Discover Available Tools

```
RUBE_SEARCH_TOOLS
queries: [{use_case: "your specific Big Data Cloud task"}]
session: {id: "existing_session_id"}
```

### Step 2: Check Connection

```
RUBE_MANAGE_CONNECTIONS
toolkits: ["big_data_cloud"]
session_id: "your_session_id"
```

### Step 3: Execute Tools

```
RUBE_MULTI_EXECUTE_TOOL
tools: [{
  tool_slug: "TOOL_SLUG_FROM_SEARCH",
  arguments: {/* schema-compliant args from search results */}
}]
memory: {}
session_id: "your_session_id"
```

## Known Pitfalls

- **Always search first**: Tool schemas change. Never hardcode tool slugs or arguments without calling `RUBE_SEARCH_TOOLS`
- **Check connection**: Verify `RUBE_MANAGE_CONNECTIONS` shows ACTIVE status before executing tools
- **Schema compliance**: Use exact field names and types from the search results
- **Memory parameter**: Always include `memory` in `RUBE_MULTI_EXECUTE_TOOL` calls, even if empty (`{}`)
- **Session reuse**: Reuse session IDs within a workflow. Generate new ones for new workflows
- **Pagination**: Check responses for pagination tokens and continue fetching until complete

## Quick Reference

| Operation | Approach |
|-----------|----------|
| Find tools | `RUBE_SEARCH_TOOLS` with Big Data Cloud-specific use case |
| Connect | `RUBE_MANAGE_CONNECTIONS` with toolkit `big_data_cloud` |
| Execute | `RUBE_MULTI_EXECUTE_TOOL` with discovered tool slugs |
| Bulk ops | `RUBE_REMOTE_WORKBENCH` with `run_composio_tool()` |
| Full schema | `RUBE_GET_TOOL_SCHEMAS` for tools with `schemaRef` |

---
*Powered by [Composio](https://composio.dev)*

INPUTS

use_case REQUIRED

specific Big Data Cloud task description for tool search

e.g. Big Data Cloud operations

session_id

session identifier for reusing context

e.g. existing_session_id

tool_slug REQUIRED

discovered tool identifier from search results

e.g. TOOL_SLUG_FROM_SEARCH

REQUIRED CONTEXT

RUBE_SEARCH_TOOLS availability
active big_data_cloud connection via RUBE_MANAGE_CONNECTIONS

OPTIONAL CONTEXT

existing session_id

TOOLS REQUIRED

RUBE_SEARCH_TOOLS
RUBE_MANAGE_CONNECTIONS
RUBE_MULTI_EXECUTE_TOOL
RUBE_REMOTE_WORKBENCH
RUBE_GET_TOOL_SCHEMAS

ROLES & RULES

Always search tools first for current schemas
Always call RUBE_SEARCH_TOOLS first to get current tool schemas
Always discover available tools before executing workflows
Verify Rube MCP is available by confirming RUBE_SEARCH_TOOLS responds
Call RUBE_MANAGE_CONNECTIONS with toolkit big_data_cloud
Confirm connection status shows ACTIVE before running any workflows
Never hardcode tool slugs or arguments without calling RUBE_SEARCH_TOOLS
Verify RUBE_MANAGE_CONNECTIONS shows ACTIVE status before executing tools
Use exact field names and types from the search results
Always include memory in RUBE_MULTI_EXECUTE_TOOL calls, even if empty
Reuse session IDs within a workflow
Generate new session IDs for new workflows
Check responses for pagination tokens and continue fetching until complete

EXPECTED OUTPUT

Format

markdown

Constraints

include prerequisites and setup steps
always start with RUBE_SEARCH_TOOLS
use exact function call formats shown
list known pitfalls

SUCCESS CRITERIA

Rube MCP must be connected
Active Big Data Cloud connection via RUBE_MANAGE_CONNECTIONS
Always call RUBE_SEARCH_TOOLS first
Confirm connection status shows ACTIVE

FAILURE MODES

Tool schemas change so never hardcode without searching first
May attempt execution without active connection
May use incorrect field names or types
May omit memory parameter in execute calls
May fail to reuse or correctly manage session IDs
May stop before handling all pagination tokens

CAVEATS

Dependencies

Rube MCP must be connected (RUBE_SEARCH_TOOLS available)
Active Big Data Cloud connection via RUBE_MANAGE_CONNECTIONS

Missing context

Target model or agent framework expected to consume the prompt
Preferred output format when reporting workflow results

Ambiguities

Placeholder text such as "your specific Big Data Cloud task" and "your_session_id" left for user substitution without explicit markup.

QUALITY

OVERALL: 0.81
CLARITY: 0.78
SPECIFICITY: 0.85
REUSABILITY: 0.82
COMPLETENESS: 0.80

IMPROVEMENT SUGGESTIONS

Wrap all user-supplied values in clearly delimited placeholders (e.g., {{use_case}}, {{session_id}}) and add a short "Variables" section at the top.
Add a one-sentence success criterion at the end of the Core Workflow Pattern section.

USAGE

Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.