Drupal Duplicate Content

Posted : admin On 1/25/2022
The Global Redirect module can painlessly eliminate duplicate content on your Drupal website
Duplicate content penalty

The Drupal module helps in creating the content duplication is Node clone. The node clone having the features below. The Node clone Drupal module allows users to make a copy of an existing item of site content (a node) and then edit that copy. The Node clone Drupal module will allow ownership to the current user who is duplicating the content.

A while back, Ben Finklea wrote about how easy it is to accidentally create duplicate content, and how to fix it. Though it has been some time since the post went up, it remains popular and useful advice— so we wanted to share it with you again.

Duplicate content isn’t just annoying and inconvenient— it’s bad for your SEO, and it’s very easy to accidentally create duplicate content on a Drupal site. But what is duplicate content? According to Google, duplicate content is “substantive blocks of content within or across domains that either completely match other content or are appreciably similar.”

(In layman’s terms, duplicate content is content on your website that’s really, really similar, if not identical, to other content on your website or even elsewhere on the web.)

  • Dec 19, 2016 To setup a Windows machine to run cron.php at a specific time follow the specific instructions below. This can be useful if you are not familiar with Linux/Unix, or if your web host does not offer the ability to run cron jobs; you can run them remotely from your own computer. Note: These instructions were written for Windows XP but should be similar in other versions of Windows.
  • Drupal (/ ˈ d r uː p əl /) is a free and open-source web content management framework (CMF) written in PHP and distributed under the GNU General Public License. Drupal provides a back-end framework for at least 12% of the top 10,000 websites worldwide – ranging from personal blogs to corporate, political, and government sites. Systems also use Drupal for knowledge management and for.
  • Mobile MVD is a traveling MVD office that assists customers with most MVD transactions. A list of Mobile MVD services is shown below. Current 2020 Mobile MVD schedule. There are no stops currently scheduled.

Some duplicate content is malicious: spammers stealing your website’s content and posting it as their own is malicious duplicate content, for example. However, the most duplicate content that we find (by far) is non-malicious duplicate content, such as:

  • Discussion forums that can generate both regular and stripped-down pages targeted at mobile devices.
  • Store items shown or linked via multiple distinct URLs
  • Printer-only versions of web pages

Unfortunately, when this happens, it can hurt your rankings: Google can’t figure out which duplicate content is the original source, and your content winds up competing against itself. For websites that have multiple pages with very similar content, there are certainly ways to indicate your preferred URL to Google— and for Drupal websites that have duplicate content, one of the best ways to fix the problem is by installing and using the Global Redirect module.

The Global Redirect Module

The Global Redirect module will take care of some housekeeping issues that come up when clean URLs are enabled in Drupal. In short, it’ll eliminate some of the duplicate content issues that you may not have known you had.

Here’s an example from Ben.

Drupal

'Let’s say, for example, that you create a new website and create the first node that you call the About Us page. Later, because you want the front page of your site to be the content of that node, you go into site settings and make node/1 the front page of the site. Sounds pretty harmless, right? Well, right at this moment, all of these URLs on your site would show the exact same content:

Duplicate Content Penalty

The search engines will think that you have six pages of the exact same content. That's never good. Global Redirect fixes that by redirecting all the URLs you don't want to the one URL that you do.'

The Global Redirect does a few neat tricks to make this happen. According to the module’s homepage on Drupal.org, it:

  • Checks the current URL for an alias and does a 301 redirect to it if it is not being used.
  • Checks the current URL for a trailing slash, removes it if present, and repeats check 1 with the new request.
  • Checks if the current URL is the same as the site_frontpage and redirects to the frontpage if there is a match.
  • Checks if the Clean URLs feature is enabled and then checks the current URL is being accessed using the clean method rather than the unclean method.
  • Checks access to the URL. If the user does not have access to the path, then no redirects are done. This helps avoid exposing private aliased nodes.
  • Make sure the case of the URL being accessed is the same as the one set by the author/administrator. For example, if you set the alias 'articles/cake-making' to node/123, then the user can access the alias with any combination of case.

For those who want to install it, but aren’t that technical, it’s pretty easy: download the module from https://www.drupal.org/project/globalredirect and configure by navigating to http://www.yourDrupalsite.com/settings/globalredirect or clicking on Admin Site configuration Global Redirect

Some of your options will include the following:

  • Deslash: Set to On. If enabled, this option will remove the trailing slash from requests. If you require certain requests to have a trailing slash, this feature can cause problems and so may need to be disabled— otherwise, leave it on.
  • Non-clean to Clean: Set to On. If enabled, this option will redirect from Non-clean to Clean URL (if Clean URL's are enabled). This will stop, for example, node 1 existing on both yourDrupalsite.com/node/1 and yourDrupalsite. com?q=node/1.
  • Remove Trailing Zero Argument: Set to Disabled. If enabled, any instance of /0 will be trimmed from the right of the URL. This stops duplicate pages such as taxonomy/ term/1 and taxonomy/term/1/0 where 0 is the default depth. There is an option of limiting this feature to taxonomy term pages only or allowing it to affect any page. By default this feature is disabled to avoid any unexpected behavior.
  • Menu Access Checking: Set to Disabled. If enabled, the module will check the user has access to the page before redirecting. This helps to stop redirection on protected pages and avoids giving away secret URL's. By default this feature is disabled to avoid any unexpected behavior.
  • Case Sensitive URL Checking: Set to Enabled. If enabled, the module will compare the current URL to the alias stored in the system. If there are any differences in case then the user will be redirected to the correct URL. Click Save configuration. Now your site is protected from duplicate content.

Best of luck to you in eliminating all of the duplicate content from your website. If you have problems with duplicate content, installing the Global Redirect module should help you rank higher in the SERPs relatively quickly; in the meantime, make sure you’re following SEO best practices. For advice or help getting your website sorted out, feel free to contact us. We’re always happy to lend a hand!

Drupal already has clean URLs and that is a big plus, you don’t need a special module for clean SEO friendly URLs. It needs a module for meta tags called node words but that can be easily added. But there are still some glitches that can be improved. The best way is to do it manually.

You know that SE don’t like duplicate content. And this is exactly what is happening inside Drupal site. With proper structure that is SEO friendly you can significantly increase the traffic to your site.

Drupal And Duplicate Content

For example your Drupal site can have 2 exact URLs, one points to a file and the other to directory.

http://mysite.net/articles/Drupal-SEO
http://mysite.net/articles/Drupal-SEO/

Drupal Duplicate Content

So on a normal Drupal site with clean URLs enabled Drupal will show the same content for both these links. This is useful because people can link to either one and not get a 404 error.

But regarding SEO, having 2 pages with same content might hurt you bad. This is known as duplicate content in SEO world.
Duplicate content can reduce your traffic. There is a simple solution for this Drupal SEO problem.

Redirecting Drupal with the .htaccess File

Every Drupal site has a .htaccess file that tell your server how to handle your sites URLs. So you need to just add 3 custom lines at the end of your .htaccess file so the server will automatically remove any trailing slashes in URLs.
#remove trailing slashes

RewriteCond %{HTTP_HOST} ^(www.)?seoandwebdesign.com$ [NC]
RewriteRule ^(.+)/$ http://%{HTTP_HOST}/$1 [R=301,L]

Drupal 8 duplicate content type

This is the code that works. The trailing trashes are removed from URLs. Instead of my site SEOandwebdesign.com use your URLs and also change the .com in the code if your site is .net or .org.

So the search engine bot will be redirected to the correct URL. This way you can protect your site against duplicate content.

Using robots.txt To Hide Duplicate Pages

We're not done yet, there's still some duplicate content to take care of. Here's our next example:

http://seoandwebdesign.com/articles/Drupal-SEO
http://seoandwebdesign.com/node/44

(I am using the URLs of my site seoandwebdesign.com that is actually a Joomla site not Drupal, so bare that in mind, replace my domain name with yours. )

Drupal duplicate content type

Again here are 2 URLs with the same content but different URLs. Google does not know that and your site will trigger Google duplicate content trigger. So each page with friendly URL has also a default node URL that has to be excluded from Google index.
To do that you just simply add this line to your robots.txt file.

Disallow: /node/

Not all the search engine bots will ignore all the node URLs and that will eliminate duplicate content problem.
But if there are some pages that do not have custom set URLs, and have only node URL, those pages will not be indexed at all. SO be careful to set each page with friendly URL and insert some keywords in URL. You can use pathauto module to automatically generate URLs.

Drupal Global Redirect Module

You can also use this module Global Redirect module that will forward all of your /node/ URLs to the proper alias, and it even removes trailing slashes for you. If you have recently started to use SEO friendly aliased URLs instead of simple node ones, this module is your best choice.

Drupal-cdn-redirect Duplicate Content Prevention

But if you have used aliased URLs from the start this won’t be an issue for your site. I prefer setting robots.txt and .htaccess files manually, and reduce the module server load.