From Startup to Exit

Legalities of scraping data to build LLMs, a conversation w/ Adam Shevell, partner at Wilson Sonsini

TiE Seattle Season 1 Episode 16

Many of the large language model providers have built their LLM models by scraping data from websites and open source sites. Some have licensed the content like Open AI has done with Reddit. Nonetheless, many of these LLM providers have been sued for illegally copying data for which they have no permission. In this podcast, Adam shares his thoughts on the Fair Use doctrine and various legal opinions on the legality of scraping data to build LLM models. Here's also an interesting article that Adam has written for developing, extending and using Generative AI models.

Adam Shevell is a partner in Wilson Sonsini’s San Francisco office, where he co-leads the firm’s technology transactions practice in the city. Adam advises technology companies and their investors at all stages of company development, from pioneering start-ups to leading global enterprises, angel investors, venture capital firms, and other institutions in the start-up ecosystem. 

Adam represents leading Silicon Valley companies on complex and strategic transactions involving cutting-edge innovations and the launch of new products. By understanding his clients’ products, markets, and business priorities, and by building deep and lasting relationships, Adam provides creative and pragmatic advice focused on delivering effective results.

Adam also works closely with Canadian start-ups on their U.S. expansion, fundraising, strategic partnerships, and exit transactions, and with Canadian venture funds investing in U.S.-based companies.

Brought to you by TiE Seattle
Hosts: Shirish Nadkarni and Gowri Shankar
Producers: Minee Verma and Eesha Jain
YouTube Channel: https://www.youtube.com/@tieseattle567

People on this episode