The global and independent platform for the SAP community.

AI—From Start to Finish

It's common knowledge that AI feeds off the vastness of the World Wide Web. For AI, texts, photos, videos, and graphics are the source of "knowledge." ChatGPT and similar programs provide the answers. But what if the sources dry up?
Peter M. Färbinger, E3 Magazine
November 24, 2025
avatar

Although it has not been proven in all cases, the assumption seems obvious: despite virtual prohibition signs and paywalls, the operators of large language models (LLMs) are plundering the wealth of data on the internet. AI pioneers are among the world's best computer scientists, and for them, circumventing any hurdle or barrier is child’s play.

There is a WWW etiquette: at the beginning of a website's HTML code, a virtual entry ban for bots and crawlers can be programmed. This barrier can be useful for various reasons. For example, if a website is under construction and still contains test data, it makes little sense for a Google crawler to index these pages. A web crawler, also called a spider or bot, is an automated program that searches the internet to collect and index content from websites. Crawlers follow hyperlinks to discover new web pages and store information such as titles, images, and keywords to create searchable indexes for search engines like Google and Bing.

This prohibition sign for web crawlers at the beginning of a website can, of course, also be used to protect your own content. The prerequisite, of course, is compliance with WWW etiquette. In other words, any protection can be circumvented with more sophisticated programming. Numerous experiments prove that the web crawlers of major IT companies regularly bypass virtual prohibition signs to train their LLMs.

Authors, journalists, artists, photographers, and all content producers consider this circumvention of a technical barrier to be copyright infringement and theft of intellectual property. There are preliminary legal opinions and court rulings on this issue in the US. In short, some US judges believe that these signs can be circumvented for AI training purposes. However, this does not mean that these texts and photos may be used in AI responses and results. This may be legally tenable, but it contradicts human sensibilities.

For training purposes, the AI can read E3 magazines but cannot quote them. A summary of E3 is probably sufficient to help someone seeking assistance from the SAP community, which the AI can do well with the training data. There is no need for verbatim quotes—the cat is out of the bag anyway, right?

Ultimately, it's a financial issue. Anyone who used E3 content commercially had a business relationship with the publisher. This ensured the give-and-take that is so important in the SAP community. New sources could also emerge. However, if AI now plunders E3 sources without providing anything in return, there is a risk that E3 and many other independent SAP sources will dry up.

In a few years, only official SAP websites and the SAP User Group's web presence may be available for training large language models. The results will then be more modest.

avatar
Peter M. Färbinger, E3 Magazine

Peter M. Färbinger, Publisher and Editor-in-Chief E3 Magazine DE, US and ES (e3mag.com), B4Bmedia.net AG, Freilassing (DE), E-Mail: pmf@b4bmedia.net and Tel. +49(0)8654/77130-21


Write a comment

Working on the SAP basis is crucial for successful S/4 conversion. 

This gives the Competence Center strategic importance for existing SAP customers. Regardless of the S/4 Hana operating model, topics such as Automation, Monitoring, Security, Application Lifecycle Management and Data Management the basis for S/4 operations.

For the fourth time, E3 magazine is organizing a summit for the SAP community in Salzburg to provide comprehensive information on all aspects of S/4 Hana groundwork.

Venue

FourSide Hotel Salzburg,
Trademark Collection by Wyndham
Am Messezentrum 2, 5020 Salzburg, Austria
+43-66-24355460

Event date

Wednesday, June 10, and
Thursday, June 11, 2026

Early Bird Ticket

Regular ticket

Subscribers to the E3 Magazine Ticket

reduced with promocode CCAbo26

Students*

reduced with promocode CCStud26.
Please send proof of studies by e-mail to office@b4bmedia.net.
*The first 10 tickets are free of charge for students. Try your luck! 🍀
EUR 390 excl. VAT
available until November 30, 2025
EUR 590 excl. VAT
EUR 390 excl. VAT
EUR 290 excl. VAT

Venue

Hotel Hilton Heidelberg
Kurfürstenanlage 1
D-69115 Heidelberg

Event date

Wednesday, April 22 and
Thursday, April 23, 2026

Tickets

Early Bird Ticket
Regular ticket
EUR 390 excl. VAT
available until 30.11.2025
EUR 590 excl. VAT
Subscribers to the E3 magazine
reduced with promocode STAbo26
EUR 390 excl. VAT
Students*
reduced with promocode STStud26.
Please send proof of studies by e-mail to office@b4bmedia.net.
EUR 290 excl. VAT
*The first 10 tickets are free of charge for students. Try your luck! 🍀
The event is organized by the E3 magazine of the publishing house B4Bmedia.net AG. The presentations will be accompanied by an exhibition of selected SAP partners. The ticket price includes attendance at all presentations of the Steampunk and BTP Summit 2026, a visit to the exhibition area, participation in the evening event and catering during the official program. The lecture program and the list of exhibitors and sponsors (SAP partners) will be published on this website in due course.