| 开发者 | odysseynewmedia |
|---|---|
| 更新时间 | 2026年2月25日 23:11 |
| PHP版本: | 5.8 及以上 |
| WordPress版本: | 6.9 |
| 版权: | GPLv2 or later |
| 版权网址: | 版权信息 |
llms.txt file is instantly generated. No configuration is needed.
For Power Users: Manage every aspect of your AI strategy. Track bot traffic with built-in analytics, generate JSONL datasets for fine-tuning, and clean up your content with CSS selectors.
Concepts Explained: Why do you need this?
1. What is llms.txt?
Think of llms.txt as a "Sitemap for AI". While humans use HTML pages and Search Engines use XML sitemaps, AI agents look for an llms.txt file in your root directory. This file gives them a clean, prioritised list of links to crawl, ensuring they train on your best content and ignore the junk.
2. What is llms-full.txt (Markdown)?
This is an optional advanced feature (RAG-Ready). Instead of just providing links, llms-full.txt provides your actual website content converted into clean, lightweight Markdown format.
* Why it's useful: It allows AI agents to ingest your website's knowledge immediately without needing to visit and scrape every single HTML page. This reduces server load and ensures the AI gets accurate data for "Retrieval Augmented Generation" (RAG).
* ⚠️ WARNING regarding Virtual Mode Limits: When using Virtual Mode to generate this file, the item limit for the llms-full.txt file is securely capped at 50 by default. Manually increasing this limit beyond 50 in the 'Tools' settings will drastically increase server load and risks causing immediate 500/503 server crashes. Use this feature at your own risk. If you require more than 50 items in your llms-full.txt file, we recommend using Physical Mode instead.
3. What is llms.jsonl (Fine-Tuning)?
This file formats your content into prompt-completion pairs (JSON Lines). This is the standard format used to fine-tune models like GPT-4 or Llama 3 on your specific data.
New Features in 6.0:
.sidebar, .comments) to strip unwanted elements from your Markdown and JSONL files.Allow or Disallow rules for individual AI crawlers (GPTBot, Google-Extended, etc.).robots.txt.odyssey-llms folder to the /wp-content/plugins/ directory.llms.txt file has been generated. A new "Odyssey LLMS" menu item will appear in your admin sidebar.The settings page is located in its own top-level menu in your WordPress admin sidebar, labelled Odyssey LLMS.
In the "Content Intelligence" tab, look for the "Content Cleaning (CSS)" field. Enter CSS selectors for elements you want to remove, separated by commas (e.g., .footer, .nav, #sidebar).
Go to the main settings tab and select "Virtual File" as your serving method. This allows WordPress to intercept bot requests and log them to your dashboard.
No. The Robots.txt editor includes intelligent conflict detection. It will automatically fetch and import any virtual rules created by other SEO plugins so you don't lose them.
⚠️ WARNING: While you can manually increase the post/page limit for the llms-full.txt file in the settings, we strongly warn against setting it too high (especially in Virtual Mode). Doing so will drastically increase server load and risks causing immediate 500/503 server crashes due to the heavy processing required. If you require a large number of items, we recommend using Physical Mode.
llms-full.txt (Markdown) for richer context if available.llms.jsonl).llms-full.txt with full content converted to Markdown.robots.txt. Includes logic to automatically fetch existing virtual rules (from Yoast/RankMath) to prevent data loss.llms.txt for better AI context.llms.txt virtually (required for Analytics) or physically (for performance).llms.txt file is now correctly generated asynchronously immediately upon plugin activation.Disallow rules for each individual AI crawler, offering fine-tuned control.Sitemap: directive to the generated file for better crawler discovery if a sitemap URL is provided.odyssey_llms_default_rawlers to allow developers to extend the default crawler list.llms.txt format with structured sections and added a checklist of common AI crawlers.