Sunday, April 2, 2023
SCIENMAG: Latest Science and Health News
No Result
View All Result
  • Login
  • HOME PAGE
  • BIOLOGY
  • CHEMISTRY AND PHYSICS
  • MEDICINE
    • Cancer
    • Infectious Emerging Diseases
  • SPACE
  • TECHNOLOGY
  • CONTACT US
  • HOME PAGE
  • BIOLOGY
  • CHEMISTRY AND PHYSICS
  • MEDICINE
    • Cancer
    • Infectious Emerging Diseases
  • SPACE
  • TECHNOLOGY
  • CONTACT US
No Result
View All Result
Scienmag - Latest science news from science magazine
No Result
View All Result
Home SCIENCE NEWS Mathematics

A new and better way to create word lists

March 13, 2023
in Mathematics
0
Share on FacebookShare on Twitter

Word lists are the basis of so much research in so many fields. Researchers at the Complexity Science Hub have now developed an algorithm that can be applied to different languages and can expand word lists significantly better than others.
 

Expanding word lists in a more accurate way

Credit: © Complexity Science Hub

Word lists are the basis of so much research in so many fields. Researchers at the Complexity Science Hub have now developed an algorithm that can be applied to different languages and can expand word lists significantly better than others.
 

Many projects start with the creation of a word list. Not only in companies when mind maps are created, but also in all areas of research. Imagine you want to find out on which days people are in a particularly good mood by analyzing Twitter postings. Just looking for the word “happy” wouldn’t be enough. 

Instead, you would have to use an algorithm that detects all tweets that indicate that someone is happy. “So the first step is to create a list of all the words that indicate just that. The whole research stands or falls on doing so,” explains Anna Di Natale, a researcher at the Complexity Science Hub in Vienna. But how to come up with the most accurate, complete word lists possible? 

A PROBLEM THAT CONCERNS MANY

This widespread problem not only concerns opinion researchers who want to find out how politicians’ statements are received by the public. Companies, too want to find out how their products are perceived through sentiment analysis. 

To improve things, Di Natale has now developed a new method, called LEXpander, that outperforms previous algorithms. And this even in two different languages – German and English. Moreover, for the very first time ever, she has developed a way through which it is possible to compare different tools at all.

IMPROVED PERFORMANCE

In comparison with four other algorithms for wordlist expansion (WordNet, Empath 2.0, FastText and GloVe), LEXpander performed significantly better, especially in German. For example, the researchers found that LEXpander guesses 43% of words right when expanding an English word list for positive meaning. A very popular model, FastText, in comparison, is right only 28% of the time. 

INDEPENDENCE FROM THE LANGUAGE ITSELF

The reason is that this tool works language-independently. It is not based on one language, but on a so-called colexification network. This recognized linguistic concept resides on homonyms and polysemies, single words that have two or more distinct meanings. For example: the ancient Greek word φάρμακον (pharmacon) can mean medicine or poison. Two different things, but thematically close. But there are others that don’t suggest kinship – such as “bank” as a financial institution or the land alongside a river. 

“If you collect them across many languages – and here we analyzed about 19 different languages – you can see connections between them,” Di Natale says. The network is formed when these colexifications occur in several languages across different language families, creating connections.

This independence from the language itself allows LEXpander to achieve better results in different languages. “There are many methods developed for English. They work very well and quickly and everyone uses them. Trying to apply them to other languages works, but not as well as it might work if you had started developing a method for German or Italian,” Di Natale explains. 

IMPORTANT FOR NEW TOPICS LIKE COVID

For many topics there are already good word lists. But for new topics – like when COVID came up – new ones have to be created. Until now, they were usually created by hand during brainstorming with colleagues and several tools were used to help. But until now there was no way to compare them. Anna Di Natale and her team have now created this possibility and have also developed a new tool that performs better than the others. This can be an important cornerstone for many future research projects in various fields.

 

 

FIND OUT MORE

The study “LEXpander: Applying colexification networks to automated lexicon expansion” has been published in Behavior Research Methods.

ABOUT THE COMPLEXITY SCIENCE HUB

The mission of the Complexity Science Hub (CSH Vienna) is to host, educate, and inspire complex systems scientists dedicated to making sense of Big Data to boost science and society. Scientists at the Complexity Science Hub develop methods for the scientific, quantitative, and predictive understanding of complex systems.

The CSH Vienna is a joint initiative of AIT Austrian Institute of Technology, Central European University CEU, Danube University Krems, Graz University of Technology, IIASA, Medical University of Vienna, TU Wien, VetMedUni Vienna, Vienna University of Economics and Business, and Austrian Economic Chambers (WKO). https://www.csh.ac.at

 



Journal

Behavior Research Methods

DOI

10.3758/s13428-023-02063-y

Method of Research

Data/statistical analysis

Subject of Research

People

Article Title

LEXpander: Applying colexification networks to automated lexicon expansion

Article Publication Date

10-Mar-2023

Tags: createlistsword
Share26Tweet16Share4ShareSendShare
  • Thrushes

    A final present from birds killed in window collisions: poop that reveals their microbiomes

    81 shares
    Share 32 Tweet 20
  • Why are forests turning brown in summer?

    66 shares
    Share 26 Tweet 17
  • Professor Yasmine Belkaid appointed Institut Pasteur President

    66 shares
    Share 26 Tweet 17
  • Conversion to Open Access using equitable new model sees upsurge in usage of expert scientific knowledge

    68 shares
    Share 27 Tweet 17
  • New, exhaustive study probes hidden history of horses in the American West

    65 shares
    Share 26 Tweet 16
  • Null results research now published by major behavioral medicine journal

    651 shares
    Share 260 Tweet 163
ADVERTISEMENT

About us

We bring you the latest science news from best research centers and universities around the world. Check our website.

Latest NEWS

A final present from birds killed in window collisions: poop that reveals their microbiomes

Null results research now published by major behavioral medicine journal

The “Stonehenge calendar” shown to be a modern construct

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 205 other subscribers

© 2023 Scienmag- Science Magazine: Latest Science News.

No Result
View All Result
  • HOME PAGE
  • BIOLOGY
  • CHEMISTRY AND PHYSICS
  • MEDICINE
    • Cancer
    • Infectious Emerging Diseases
  • SPACE
  • TECHNOLOGY
  • CONTACT US

© 2023 Scienmag- Science Magazine: Latest Science News.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In