Features Overview
Skill Seekers v3.5.0 is a comprehensive toolkit for transforming any knowledge source into structured AI skills and RAG-ready knowledge.
Input Sources (17 Types)
Documentation Scraping
- HTML websites — Scrape any documentation site (React, Vue, Django, Godot, etc.)
- Smart SPA discovery (v3.5.0) — Three-layer engine: sitemap.xml, llms.txt, and browser rendering for JavaScript SPA sites
- Browser rendering (v3.5.0) — Playwright-based rendering for React, Vue, Angular sites that return empty HTML shells
- Unlimited pages — No page limits, handles 40K+ page documentation sites
- Smart selectors — Automatic content detection or custom CSS selectors
GitHub Repository Analysis
- Unlimited local analysis — Analyze entire codebases without API rate limits (50-file limit removed in v3.5.0)
- Three-stream fetcher — Code + Docs + Issues streams
- C3.x deep analysis — AST parsing across 27+ languages including Kotlin (v3.5.0)
- 10 GoF pattern detectors — Singleton, Factory, Strategy, Observer, etc.
- Test example extraction — Real usage examples from test files
PDF & Document Extraction
- PDF — Text extraction with OCR fallback for scanned documents
- Word (.docx) (v3.2.0) — Full document extraction
- EPUB (v3.3.0) — Chapter extraction, DRM detection, EPUB 2/3 support
- PowerPoint (.pptx) (v3.3.0) — Slide text, speaker notes, tables, code block detection
- AsciiDoc (v3.3.0) — Headings, code blocks, tables, admonitions
Media & API Sources
- Video (v3.2.0) — YouTube and local video extraction: transcripts, visual OCR, code timeline tracking
- Jupyter Notebooks (v3.3.0) — Code cells, outputs, markdown, kernel metadata
- OpenAPI/Swagger (v3.3.0) — Endpoint extraction, schema resolution, security schemes
- RSS/Atom feeds (v3.3.0) — Article extraction with optional full-page scraping
- Local HTML (v3.3.0) — Smart main content detection
- Man pages (v3.3.0) — Structured section parsing, gzip/bzip2/xz support
Collaboration & Wiki Sources
- Confluence (v3.3.0) — REST API or HTML export, page hierarchy, macros
- Notion (v3.3.0) — API or Markdown export, 20+ block types, 16+ property types
- Slack/Discord (v3.3.0) — Workspace exports or live API, threads, code snippets
Output Platforms (12+)
| Category | Platforms |
|---|---|
| AI Skills | Claude, Gemini, OpenAI, Kimi, DeepSeek, Qwen, OpenRouter, Together, Fireworks, MiniMax, OpenCode |
| RAG/Vectors | LangChain, LlamaIndex, Pinecone, Chroma, FAISS, Haystack, Qdrant, Weaviate |
| AI Coding | Cursor, Windsurf, Cline, Continue.dev, Roo, Aider, Bolt, Kilo |
| Generic | Markdown, JSON, YAML |
Agent-Agnostic Enhancement (v3.5.0)
All 5 enhancers now support multiple AI agents via the unified AgentClient abstraction:
- API mode: Anthropic, Moonshot/Kimi, Google Gemini, OpenAI
- LOCAL mode: Claude Code, Kimi, Codex, Copilot, OpenCode, or any custom agent
- Auto-detection: Automatically detects available agent from API keys
- Custom agents:
--agent-cmd "my-agent run"for any CLI-based agent
skill-seekers create https://react.dev --agent kimi
skill-seekers create https://react.dev --agent codex
skill-seekers create https://react.dev --agent-cmd "my-custom-agent run"
Marketplace & Publishing (v3.5.0)
- MarketplacePublisher — Publish skills to Claude Code plugin marketplace repos
- MarketplaceManager — Register and manage marketplace repositories
- ConfigPublisher — Push configs to registered config source repos
push_configMCP tool — Automated config publishing
MCP Integration (40 Tools)
40 MCP tools across 10 categories for AI agents to prepare their own knowledge:
- Scraping tools (11) — All 18 source types accessible
- Config management, validation, packaging, installation
- Marketplace publishing and config pushing
- Supports Claude Code Desktop, Cursor, and other MCP-compatible agents
CLI Commands (27+)
Key commands:
create— Unified entry point for all source types with auto-detectiondoctor(v3.5.0) — 8 diagnostic checks for troubleshootingsync-config(v3.3.0) — Crawl doc sites and sync URL listspackage— Export to any platform with--targetinstall/install-agent— Install to AI agent directoriesworkflows— Manage enhancement workflow presets
Architecture Documentation (v3.4.0)
- 21 UML diagrams created with StarUML, synced from source code
- 14 class diagrams covering all modules
- 7 behavioral diagrams (sequence, activity, component)
- View Architecture Diagrams
- Full API Reference
Security (v3.5.0)
- Prompt injection detection — Bundled workflow scans scraped content for injection patterns
- defusedxml for XXE protection in sitemap parsing
- Path traversal validation for config names
- Auth token cleanup from cached Git configs
Quality & Testing
- 3194+ tests passing with 39 expected skips
- Comprehensive coverage across all 18 source types
- Integration tests with local HTTP servers
- Performance benchmarks with stabilized thresholds