Opening Cross: Big Data, Bigger Data Licensing Challenges
Unauthorized redistribution of data remains a major issue for the market data industry, whether it’s fly-by-night data vendors in distant jurisdictions scraping data from legitimate vendors’ feeds, trading firms deliberately under-reporting usage to keep licensing costs down, and hoping that they won’t get audited, or unwittingly allowing data to flow to end-users or clients that command a different tier of fees, or companies creating structured products based on indexes that they haven’t acquired a license to use.
In the case of newswires and news media such as ourselves, our data asset is our news. And while we expect to find the occasional rogue website blatantly copying our copy, it’s unusual to find examples of this from companies familiar with the data industry and its licensing issues. Yet we encountered exactly this instance last week, when a company copy-and-pasted an entire story onto its website, and linked to this story in an email blast. Other news sites then picked up the story, believing it to be a legitimate press release, and posted it word-for-word on their sites. One of these was, to say the least, miffed when we explained the situation, because they had unwittingly been exposed to plagiarism, and immediately removed the story.
For the record, we do offer a range of choices for anyone who just can’t live without their own personal copy of an IMD story—for a fee, of course: We can produce custom PDFs of individual stories, either for printed materials or for posting on websites—in fact, to date we have only produced reprints in PDF format (though we now also offer a more affordable text-only reprint license), so for now, if you see the text of an IMD story posted elsewhere in any other format, you can draw your own conclusions about that company’s policies—and also offer the option of paying to make a story freely available to all visitors to our site. And for those who can’t afford (or resent) the fees, we encourage companies featured in our stories to link to our website.
As with any data provider or consumer, protecting our premium content is something we take very seriously. Of course, there are other instances, such as people posting stories with a broader public interest on discussion sites, or the technology provider with a business line devoted to monitoring data consumption that seemingly-obliviously re-posts our stories, or the desktop hardware provider that has a poorly scanned version of one of our stories on its site (originally a press clipping provided by its then-PR agency). It shocks me that anyone operating within the realms of our industry is unaware—or willfully ignorant—that content is content, and the restrictions that apply to one type of premium content apply equally to other premium content sets.
Thankfully, this view is rare today, and most responsible vendors and consumers take compliance very seriously. Hence, McGraw-Hill Financial’s Content Acquisition and Strategic Alliances group—which recently hired several seasoned industry players—includes a function for managing compliance with contracts and licenses from the third-party data suppliers that it uses in-house to produce ratings, research and analysis, and those whose content forms part of its services.
However, the rise of Big Data analysis and growing volumes of unstructured data may make it harder for the industry to track and reconcile data use as efficiently. For example, tracking signals created from unstructured social media data—such as those being combined with structured data by analysis platform provider AnalytixInsight—may require more sophisticated monitoring tools such as the market surveillance application being rolled out by Software AG, or the wealth of contextual text-based information being exposed by Perfect Information’s new Filings Expert tool.
How successfully the industry can manage and track this data may play a role in how widely and quickly it is adopted in trading environments, and in whether its value can translate from promise to profit.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Emerging Technologies
An inside look: How AI powered innovation in the capital markets in 2024
From generative AI and machine learning to more classical forms of AI, banks, asset managers, exchanges, and vendors looked to large language models, co-pilots, and other tools to drive analytics.
Asset manager Saratoga uses AI to accelerate Ridgeline rollout
The tech provider’s AI assistant helps clients summarize research, client interactions, report generation, as well as interact with the Ridgeline platform.
LSEG rolls out AI-driven collaboration tool, preps Excel tie-in
Nej D’Jelal tells WatersTechnology that the rollout took longer than expected, but more is to come in 2025.
The Waters Cooler: ’Tis the Season!
Everyone is burned out and tired and wants to just chillax in the warm watching some Securities and Exchange Commission videos on YouTube. No? Just me?
It’s just semantics: The web standard that could replace the identifiers you love to hate
Data ontologists say that the IRI, a cousin of the humble URL, could put the various wars over identity resolution to bed—for good.
T. Rowe Price’s Tasitsiomi on the pitfalls of data and the allures of AI
The asset manager’s head of AI and investments data science gets candid on the hype around generative AI and data transparency.
As vulnerability patching gets overwhelming, it’s no-code’s time to shine
Waters Wrap: A large US bank is going all in on a no-code provider in an effort to move away from its Java stack. The bank’s CIO tells Anthony they expect more CIOs to follow this dev movement.
J&J debuts AI data contracts management tool
J&J’s new GARD service will use AI to help data pros query data contracts and license agreements.