Media News

BloombergGPT aims to be AI for business news

Joshua Benton of Nieman Lab writes about BloombergGPT, which aims to be a domain-specific artificial intelligence for business news.

Benton writes, “How big is BloombergGPT? Well, the company says it was trained on a corpus of more than 700 billion tokens (or word fragments). For context, GPT-3, released in 2020, was trained on about 500 billion. (OpenAI has declined to reveal any equivalent number for GPT-4, the successor released last month, citing ‘the competitive landscape.’)

“What’s in all that training data? Of the 700 million-plus tokens, 363 billion are taken from Bloomberg’s own financial data, the sort of information that powers its terminals — ‘the largest domain-specific dataset yet’ constructed, it says. Another 345 billion tokens come from ‘general purpose datasets’ obtained from elsewhere.

“The company-specific data, named FinPile, consists of ‘a range of English financial documents including news, filings, press releases, web-scraped financial documents, and social media drawn from the Bloomberg archives.’ So if you’ve read a Bloomberg Businessweek story in the past few years, it’s in there. So are SEC filings, Bloomberg TV transcripts, Fed data, and ‘other data relevant to the financial markets.'”

Read more here.

Chris Roush

Chris Roush was the dean of the School of Communications at Quinnipiac University in Hamden, Connecticut. He was previously Walter E. Hussman Sr. Distinguished Professor in business journalism at UNC-Chapel Hill. He is a former business journalist for Bloomberg News, Businessweek, The Atlanta Journal-Constitution, The Tampa Tribune and the Sarasota Herald-Tribune. He is the author of the leading business reporting textbook "Show me the Money: Writing Business and Economics Stories for Mass Communication" and "Thinking Things Over," a biography of former Wall Street Journal editor Vermont Royster.

Recent Posts

LinkedIn finance editor Singh departs

Manas Pratap Singh, finance editor for LinkedIn News Europe, has left for a new opportunity…

19 hours ago

Washington Post announces start of third newsroom

Washington Post executive editor Matt Murray sent out the following on Friday: Dear All, Over the last…

2 days ago

FT hires Moens to cover competition and tech in Brussels

The Financial Times has hired Barbara Moens to cover competition and tech in Brussels. She will start…

2 days ago

Deputy tech editor Haselton departs CNBC for The Verge

CNBC.com deputy technology editor Todd Haselton is leaving the news organization for a job at The Verge.…

2 days ago

“Power Lunch” co-anchor Tyler Mathisen is leaving CNBC

Note from CNBC Business News senior vice president Dan Colarusso: After more than 27 years…

2 days ago

Upset CoinDesk staffers send letter to owner

Members of the CoinDesk editorial team have sent a letter to the CEO of its…

2 days ago