TS - Robots.txt

Sources:

**

title: Contents 
style: nestedList # TOC style (nestedList|inlineFirstLevel)
minLevel: 1 # Include headings from the specified level
maxLevel: 4 # Include headings up to the specified level
includeLinks: true # Make headings clickable
debugInConsole: false # Print debug info in Obsidian console

Overview

Code Snippet

robots.ts:

// src/utils/robots.ts
import { MetadataRoute } from 'next'
 
export default function robots(): MetadataRoute.Robots {
  return {
    rules: {
      userAgent: '*',
      allow: '/',
      disallow: '/private/',
    },
    sitemap: 'https://acme.com/sitemap.xml',
    host:
  }
}

import type { MetadataRoute } from 'next'
 
export default function robots(): MetadataRoute.Robots {
  return {
    rules: [
      {
        userAgent: 'Googlebot',
        allow: ['/'],
        crawlDelay: 10
      },
      {
        userAgent: ['Applebot', 'Bingbot'],
        allow: ['/'],
      },
      {
        userAgent : 'GPTBot',
        disallow : ["/"]
      }
    ], 
    sitemap: 'https://your-site.dev/sitemap.xml',
  }
}

Details

You customize every individual web crawler and allow and disallow them.

userAgent: The name of the bot or web crawler.
allow: It gives access to a particular webpage such as URL, Path or directory.
disallow: It prevents the bots from accessing specific web pages such as URLs, paths, or directories on the website.
crawlDelay: how often bots can request pages from a website.

Verify whether your robots.txt file is created or not. Just visit http://localhost:3000/robots.txt URL in your browser.

Appendix

Note created on 2024-04-12 and last modified on 2024-04-12.

Backlinks

LIST FROM [[TS - Robots.txt]] AND -"CHANGELOG" AND -"04-RESOURCES/Code/TypeScript/TS - Robots.txt"

🪴 Quartz 4.0

Explorer

TS - Robots.txt

TS - Robots.txt

Overview

Code Snippet

Details

See Also

Appendix

Backlinks

Graph View

Table of Contents

Backlinks