Shutterstock pioneers 'research license' model with Lightricks, lowering barriers to AI training data

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Shutterstock is reshaping how AI companies access training data through a novel “research license” approach, launching first with AI creative technology company Lightricks. The partnership, announced today, allows Lightricks to train its open-source video generation model LTXV using Shutterstock’s extensive HD and 4K video library.

The new licensing model addresses a critical challenge in AI development: the high cost of accessing quality training data. It enables companies to start with a smaller research license for testing and experimentation before committing to more expensive commercial licenses.

Making ethical AI development more accessible for startups

“Many companies and model trainers have taken the route of unauthorized data scraping [instead of] making the necessary investment to achieve the quality and level of trust needed to develop commercially viable models,” said Daniel Mandell, Shutterstock’s global head of data licensing & AI, in an exclusive interview with VentureBeat. “However, we don’t think that financial investment should be a barrier for those looking to enter this space with an ethical approach.”

This two-phase approach could transform how startups approach AI development. Craig Andrews, Lightricks’ global PR manager, describes it as “a turning point for smaller, more agile developers who want to explore innovative applications of generative AI without the heavy upfront costs of traditional licensing.”

The timing is significant, coming amid increasing legal scrutiny of AI training data practices. Several major AI companies face lawsuits over alleged unauthorized use of copyrighted material for model training. Shutterstock’s approach offers a legitimate alternative while ensuring content creators receive compensation.

“We’re setting a standard for ethical AI development while ensuring that creators are fairly compensated for their work,” Andrews explains. “This approach not only fosters trust in the creative ecosystem but also establishes a sustainable framework for responsible AI innovation.”

Revenue sharing: A win-win for creators and AI companies

Shutterstock has implemented a revenue-sharing model where contributors receive 20% of the revenue from data licensing deals. Contributors can also opt out of having their content used for AI training, though Mandell notes only about 1% have chosen to do so.

Lightricks plans to use the licensed video data to enhance LTXV, its open-source video generation model released last month. The model has already gained significant traction, with “thousands of downloads on GitHub and Hugging Face,” according to Andrews. One notable use case is real-time video generation for interactive ecommerce.

The partnership aims to address technical challenges in AI video generation, particularly motion consistency in longer videos. “One of the biggest technical hurdles in AI video generation is achieving consistent motion and structure over longer video segments without sacrificing quality,” Andrews says. “Shutterstock’s high-quality video library provides an extensive dataset that helps us address this challenge.”

For Shutterstock, this partnership represents a strategic shift in its business model. The company has already established partnerships with major AI companies including Nvidia, Meta, and OpenAI. Mandell emphasizes that the research license model could democratize access to high-quality training data for smaller organizations and research institutions.

Setting new industry standards for ethical AI development

The collaboration also reflects a growing trend toward transparency and ethical considerations in AI development. Lightricks made LTXV open-source to promote collaboration and innovation, while Shutterstock’s licensing approach ensures proper compensation for content creators.

“The important message here is that companies, no matter the size or funding, no longer have an excuse to scrape unlicensed content for training purposes,” Mandell concludes. “There is a better way to enter this evolving market.”

This partnership could set a new standard for how AI companies access training data, potentially influencing industry practices as concerns about the sources of AI training data continue to grow. The success of this model could determine whether other content providers follow Shutterstock’s lead in creating more flexible, accessible licensing options for AI development.



Source link