The global and independent platform for the SAP community.

AI – from source to sink

It is common knowledge that AI feeds off the vastness of the World Wide Web. For AI, texts, photos, videos, and graphics are the source of „knowledge.“ The sink is the answers provided by ChatGPT and the like. What if the sources dry up?
Peter M. Färbinger, E3 Magazine
November 24, 2025
avatar
This text has been automatically translated from German to English.

It has not been proven in all cases, but the assumption seems obvious: despite virtual prohibition signs and paywalls, the operators of large language models (LLMs) are „plundering“ the wealth of data on the internet. The AI pioneers are among the best computer scientists in the world, so it should be easy for them to circumvent any hurdle or barrier.

There is a WWW etiquette: At the beginning of the HTML code of a website (homepage), a virtual entry ban for bots and crawlers can be programmed. This barrier can be very useful for various reasons: If, for example, a website is under construction and still contains test data, it makes little sense for a Google crawler to index these pages. A web crawler is an automated program (also called a spider or bot) that searches the internet to collect and index content from websites. The crawler follows hyperlinks to discover new web pages and stores information such as titles, images, and keywords to create a searchable index for search engines such as Google or Bing.

Naturally, this prohibition sign for web crawlers at the beginning of a website can also be used to protect your own content. The prerequisite is, of course, compliance with WWW etiquette. In other words, any protection can be circumvented with even more sophisticated programming. There are numerous experiments that prove that the web crawlers of the major IT pioneers regularly bypass the virtual prohibition signs to train their LLMs.

Authors, journalists, artists, photographers, and all content producers consider this circumvention of a technical barrier to be a copyright infringement and theft of intellectual property. There are preliminary legal opinions and court rulings on this issue in the US. In short, some US judges believe that the prohibition signs can be circumvented for the purpose of AI training. However, this does not mean that these texts and photos may be used in AI responses and results. It is a fine line that may be legally tenable, but it contradicts human sensibilities.

So, for training purposes, the AI is allowed to read E3 magazines, but it is not allowed to quote them. A good summary from E3 is probably enough to help someone seeking assistance from the SAP community, which the AI can certainly do very well with the „training data.“ There is no need for the luxury of a verbatim quote—the cat is out of the bag anyway, right?

Ultimately, it is a financial problem: whoever used E3 content commercially had a business relationship with the publisher. This ensured the all-important give and take in the SAP community. And new sources could emerge. If AI now „plunders“ E3 sources without providing anything in return, there is a risk that E3 and many other independent SAP sources will dry up.

In a few years, only the official SAP websites and the user group's WWW offering may be available to AI for training large language models. The responses in the valley will be more modest. (pmf)

avatar
Peter M. Färbinger, E3 Magazine

Peter M. Färbinger, Publisher and Editor-in-Chief E3 Magazine DE, US and ES (e3mag.com), B4Bmedia.net AG, Freilassing (DE), E-Mail: pmf@b4bmedia.net and Tel. +49(0)8654/77130-21


Write a comment

Working on the SAP basis is crucial for successful S/4 conversion. 

This gives the Competence Center strategic importance for existing SAP customers. Regardless of the S/4 Hana operating model, topics such as Automation, Monitoring, Security, Application Lifecycle Management and Data Management the basis for S/4 operations.

For the fourth time, E3 magazine is organizing a summit for the SAP community in Salzburg to provide comprehensive information on all aspects of S/4 Hana groundwork.

Venue

FourSide Hotel Salzburg,
Trademark Collection by Wyndham
Am Messezentrum 2, 5020 Salzburg, Austria
+43-66-24355460

Event date

Wednesday, June 10, and
Thursday, June 11, 2026

Early Bird Ticket

Regular ticket

Subscribers to the E3 Magazine Ticket

reduced with promocode CCAbo26

Students*

reduced with promocode CCStud26.
Please send proof of studies by e-mail to office@b4bmedia.net.
*The first 10 tickets are free of charge for students. Try your luck! 🍀
EUR 390 excl. VAT
available until November 30, 2025
EUR 590 excl. VAT
EUR 390 excl. VAT
EUR 290 excl. VAT

Venue

Hotel Hilton Heidelberg
Kurfürstenanlage 1
D-69115 Heidelberg

Event date

Wednesday, April 22 and
Thursday, April 23, 2026

Tickets

Early Bird Ticket
Regular ticket
EUR 390 excl. VAT
available until 30.11.2025
EUR 590 excl. VAT
Subscribers to the E3 magazine
reduced with promocode STAbo26
EUR 390 excl. VAT
Students*
reduced with promocode STStud26.
Please send proof of studies by e-mail to office@b4bmedia.net.
EUR 290 excl. VAT
*The first 10 tickets are free of charge for students. Try your luck! 🍀
The event is organized by the E3 magazine of the publishing house B4Bmedia.net AG. The presentations will be accompanied by an exhibition of selected SAP partners. The ticket price includes attendance at all presentations of the Steampunk and BTP Summit 2026, a visit to the exhibition area, participation in the evening event and catering during the official program. The lecture program and the list of exhibitors and sponsors (SAP partners) will be published on this website in due course.