Wikipedia to Soon Launch a Speech Engine for Visually Impaired Users

PUBLISHED BY
Carrie Ann



TAGS:


5 years ago




Wikipedia

Wikipedia

In an effort to make Wikipedia more accessible to the visually impaired users, the online encyclopedia has collaborated with researchers from Sweden’s KTH Royal Institute of Technology to develop the world’s first crowdsourced speech engine.

The speech synthesis platform will be optimized for the virtual encyclopedia, but will be available as an open source for portal running MediaWiki, an open source wiki package respectively.

Initially, the Wikispeech pilot project will be developed in English, Swedish and Arabic, which is set to be completed by September 2017 following which the service would be extended to rest of the 280 languages in which it is currently available.

According to Joakin Gustafson, professor of speech technology at KTH, the initial focus will be on the Swedish language followed by rudimentary English, which they believe is expedient given the huge amount of open source linguistic resources. And lastly, they plan to do the basic Arabic voice, which will be more a proof of concept.

Similar to the online encyclopedia, the speech engine would be crowdsourced. Researchers at KTH will rely on the online community’s contribution to the platform’s development. The content generated would be open and freely licensed to everyone in accordance with the rules of Wikimedia Commons.

The Wikispeech is in partnership between KTH Royal Institute of Technology, the Swedish Post and Telecom Authority, Wikimedia Sweden and STTS speech technology services. Jonas Beskow, professor of speech communication at KTH and Zofia Malisz would head the project.

According to Swedish telecommunications regulator PTS, which is funding the project, apparently 25 percent of Wikipedia users, which is about 125 million users per month, prefer access to its content in spoken form.

According to Gustafson, an open source module will be created so that any open source speech synthesizer can be plugged in. Since the framework is open, it will be easy to include or substitute certain modules in the Text-to-Speech system (TTS). The TTS open source functionality could be used by anybody for any use, not just reading web pages, he added.

The researchers further want to look into the probability for letting users record how a word should actually be pronounced and then have it automatically corrected in the transcription. Initially, it will have to use phonetic transcription (IPA) to correct the dictionary, Gustafson said.

The aim is to eventually enable Wikipedia to communicate the content to users in all of the languages in which it is accessible.

Avatar
Carrie Ann
Carrie Ann is Editor-in-Chief at Industry Leaders Magazine, based in Las Vegas. Carrie covers technology, trends, marketing, brands, productivity, and leadership. When she isn’t writing she prefers reading. She loves reading books and articles on business, economics, corporate law, luxury products, artificial intelligence, and latest technology. She’s keen on political discussions and shares an undying passion for gadgets. Follow Carrie Ann on Twitter, Facebook

Recent Posts

Accor reports $2.5 billion loss, hopeful of a recovery by next year

Accor reports $2.5 billion loss, hopeful of a recovery by next year

Accor, Europe’s largest hotel chain, reported a $2.5 billion net loss in 2020 due to the deterioration in the industry linked to the sprea...
10 hours ago
Is the sun shining again on AMC stock?

Is the sun shining again on AMC stock?

As the old adage goes, good things come to those who wait. This might come true for movie theatre operator AMC Entertainment Holdings Inc. w...
12 hours ago
HSBC turns to Asia for future growth

HSBC turns to Asia for future growth

The bank’s maximum pre-tax benefits (90%) come from its Asian business. Chairman Mark Tucker and chief executive Noel Quinn are strugg...
1 day ago
Wells Fargo to forego asset management unit to private equity for $2.1 billion

Wells Fargo to forego asset management unit to private equity for $2.1 billion

Wells Fargo has been underperforming for the past few years and has undertaken cost-cutting and restructuring to improve its numbers. Wells ...
1 day ago
Lucid Motors partners with Churchill Capital IV to go public in SPAC mega deal

Lucid Motors partners with Churchill Capital IV to go public in SPAC mega deal

Lucid Motors, the luxury electric vehicle makers, will go public with a blank-check firm Churchill Capital IV Corp in a $24 billion deal, th...
1 day ago
Aviva hives off French unit to Aéma Groupe for $3.9 billion

Aviva hives off French unit to Aéma Groupe for $3.9 billion

UK’s leading insurer Aviva Plc has sold its French unit for 3.2 billion euros ($3.9 billion), marking the biggest deal under chief executi...
3 days ago