An Overview of Social Media and Sentiment Analysis

Ieee account.

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

Numbers, Facts and Trends Shaping Your World

Read our research on:

Full Topic List

Regions & Countries

  • Publications
  • Our Methods
  • Short Reads
  • Tools & Resources

Read Our Research On:

  • Americans’ Changing Relationship With Local News

As news consumption habits become more digital, U.S. adults continue to see value in local outlets

Table of contents.

  • 1. Attention to local news
  • 2. Local news topics
  • Americans’ changing local news providers
  • How people feel about their local news media’s performance
  • Most Americans think local journalists are in touch with their communities
  • Interactions with local journalists
  • 5. Americans’ views on the financial health of local news
  • Acknowledgments
  • The American Trends Panel survey methodology

Reporters question a defense attorney at Harris County Criminal Courts at Law in Houston on March 26, 2024. (Yi-Chin Lee/Houston Chronicle via Getty Images)

The Pew-Knight Initiative supports new research on how Americans absorb civic information, form beliefs and identities, and engage in their communities.

Pew Research Center is a nonpartisan fact tank that informs the public about the issues, attitudes and trends shaping the world. Knight Foundation is a social investor committed to supporting informed and engaged communities. Learn more >

Pew Research Center conducted this study to better understand the local news habits and attitudes of U.S. adults. It is a follow-up to a similar study conducted in 2018 .

The survey of 5,146 U.S. adults was conducted from Jan. 22 to 28, 2024. Everyone who completed the survey is a member of the Center’s American Trends Panel (ATP), an online survey panel that is recruited through national, random sampling of residential addresses. This way nearly all U.S. adults have a chance of selection. The survey is weighted to be representative of the U.S. adult population by gender, race, ethnicity, partisan affiliation, education and other categories.  Read more about the ATP’s methodology .

Refer to the topline for the questions used for this survey , along with responses, and to the methodology for more details.

This is a Pew Research Center report from the Pew-Knight Initiative, a research program funded jointly by The Pew Charitable Trusts and the John S. and James L. Knight Foundation. Find related reports online at https://www.pewresearch.org/pew-knight/ .

The local news landscape in America is going through profound changes as both news consumers and producers continue to adapt to a more digital news environment. We recently asked U.S. adults about the ways they access local news, as well as their attitudes toward local journalism, finding that:

A bar chart showing Americans increasingly prefer digital pathways to local news

  • A growing share of Americans prefer to get local news online, while fewer are getting news on TV or in print. And newspapers are no longer primarily consumed as a print product – the majority of readers of local daily newspapers now access them digitally.
  • The share of U.S. adults who say they are paying close attention to local news has dropped since our last major survey of attitudes toward local news in 2018, mirroring declining attention to national news.
  • Americans still see value in local news and local journalists. A large majority say local news outlets are at least somewhat important to the well-being of their local community. Most people also say local journalists are in touch with their communities and that their local news media perform well at several aspects of their jobs, such as reporting the news accurately.
  • At the same time, a relatively small share of Americans (15%) say they have paid for local news in the last year. And many seem unaware of the major financial challenges facing local news: A 63% majority (albeit a smaller majority than in 2018) say they think their local news outlets are doing very or somewhat well financially.
  • Majorities of both major parties say local media in their area are doing their jobs well. While Republicans and GOP-leaning independents are slightly less positive than Democrats and Democratic leaners in their opinions of local media, views of local news don’t have the same stark political divides that exist within Americans’ opinions about national media .
  • Most Americans say local journalists should remain neutral on issues in their community, but a substantial minority say local journalists should take a more active role. About three-in-ten say local journalists should advocate for change in their communities, a view that’s especially common among Democrats and younger adults.

These are some of the key findings from a new Pew Research Center survey of about 5,000 U.S. adults conducted in January 2024. This is the first in a series of Pew Research Center reports on local news from the Pew-Knight Initiative, a research program funded jointly by The Pew Charitable Trusts and the John S. and James L. Knight Foundation.

Americans largely hold positive views of local news organizations

At a time when many local news outlets are struggling and Americans’ trust in the news media has waned, the vast majority of U.S. adults (85%) say local news outlets are at least somewhat important to the well-being of their local community. This includes 44% who say local journalism is extremely or very important to their community

About seven-in-ten U.S. adults (69%) say that local journalists in their area are mostly in touch with their community, up from 63% who said this in 2018. And most Americans also say their local news organizations are doing well at four key roles:

A bar chart showing most Americans say local media are doing well at different aspects of reporting

  • Reporting news accurately (71%)
  • Covering the most important stories (68%)
  • Being transparent (63%)
  • Keeping an eye on local political leaders (61%).

These are relatively positive views compared with how Americans see news organizations more broadly. For instance, a 2022 Pew Research Center survey found that fewer than half of U.S. adults say that news organizations in general do a very or somewhat good job of covering the most important stories, reporting the news accurately and serving as a watchdog over elected leaders.

A bar chart showing majorities of both political parties believe their local news media do various aspects of their jobs well

What’s more, views toward local news are not as politically polarized as Americans’ opinions about the news media overall. While Republicans and GOP-leaning independents are not quite as positive as Democrats and Democratic leaners in some of their assessments of local journalists, most Republicans still say the local media in their area are doing their jobs well.

For example, roughly three-quarters of Democrats (78%) say their local media do well at reporting news accurately, compared with about two-thirds of Republicans (66%).

By comparison, the 2022 survey found that 51% of Democrats and just 17% of Republicans say that news organizations in general do a very or somewhat good job of reporting the news accurately.

Jump to more information on views toward local news organizations.

A bar chart showing declines in attention to both local and national news

Fewer Americans are closely following local news – and other types of news

Despite these positive views toward local news organizations, there are signs that Americans are engaging less with local journalism than they used to.

The share of Americans who say they follow local news very closely has fallen by 15 percentage points since 2016 (from 37% to 22%). Most U.S. adults still say they follow local news at least somewhat closely (66%), but this figure also has dropped in recent years.

A line chart showing Americans’ preferred path to local news is moving online

This trend is not unique to local news – Americans’ attention to national and international news also has declined.

The local news landscape is becoming more digital

The ways in which Americans access local news are changing, reflecting an increasingly digital landscape – and matching patterns in overall news consumption habits .

Preferred pathways to local news

  • Fewer people now say they prefer to get local news through a television set (32%, down from 41% who said the same in 2018).
  • Americans are now more likely to say they prefer to get local news online, either through news websites (26%) or social media (23%). Both of these numbers have increased in recent years.
  • Smaller shares prefer getting their local news from a print newspaper or on the radio (9% each).

Specific sources for local news

The types of sources (e.g., outlets or organizations) Americans are turning to are changing as well:

A bar chart showing more Americans get local news from online forums than daily newspapers

  • While local television stations are still the most common source of local news beyond friends, family and neighbors, the share who often or sometimes get news there has declined from 70% to 64% in recent years.
  • Online forums, such as Facebook groups or the Nextdoor app, have become a more common destination for local news: 52% of U.S. adults say they at least sometimes get local news from these types of forums, up 14 percentage points from 2018. This is on par with the percentage who get local news at least sometimes from local radio stations.
  • Meanwhile, a third of Americans say they at least sometimes get local news from a daily newspaper, regardless of whether it is accessed via print, online or through a social media website – down 10 points from 2018. The share of Americans who get local news from newspapers is now roughly on par with the share who get local news from local government agencies (35%) or local newsletters or Listservs (31%).

Not only are fewer Americans getting local news from newspapers, but local daily newspapers are now more likely to be accessed online than in print.

A bar chart showing local newspapers are no longer accessed primarily through print

  • 31% of those who get news from daily newspapers do so via print, while far more (66%) do so digitally, whether through websites, apps, emails or social media posts that include content from the paper.
  • In 2018, just over half of those who got news from local daily newspapers (54%) did so from print, and 43% did so via a website, app, email or social media site.

There is a similar move toward digital access for local TV stations, though local TV news is still mostly consumed through a TV set.

  • In 2024, 62% of those getting news from local TV stations do so through a television, compared with 37% who do so through one of the digital pathways.
  • An even bigger majority of local TV news consumers (76%) got that news through a TV set in 2018.

Jump to more information on how people access local news.

The financial state of local news

The turmoil for the local news industry in recent years has come with major financial challenges. Circulation and advertising revenue for newspapers have seen sharp declines in the last decade, according to our analysis of industry data , and other researchers have documented that thousands of newspapers have stopped publishing in the last two decades. There also is evidence of audience decline for local TV news stations, although advertising revenue on local TV has been more stable.

A bar chart showing the share who think their local news is doing well financially has fallen since 2018 but is still a majority

When asked about the financial state of the news outlets in their community, a majority of Americans (63%) say they think their local news outlets are doing very or somewhat well, with a third saying that they’re not doing too well or not doing well at all. This is a slightly more pessimistic view than in 2018, when 71% said their local outlets were doing well, though it is still a relatively positive assessment of the financial state of the industry.

Just 15% of Americans say they have paid or given money to any local news source in the past year – a number that has not changed much since 2018. The survey also asked Americans who did not pay for news in the past year the main reason why not. The most common explanation is that people don’t pay because they can find plenty of free local news, although young adults are more inclined to say they just aren’t interested enough in local news to pay for it.

Jump to more information on how people view the financial state of local news.

Other key findings in this report

A bar chart showing weather, crime, traffic and government are all commonly followed local news topics

Americans get local news about a wide variety of topics. Two-thirds or more of U.S. adults at least sometimes get news about local weather, crime, government and politics, and traffic and transportation, while smaller shares (but still at least half) say they get local news about arts and culture, the economy, schools, and sports.

Relatively few Americans are highly satisfied with the coverage they see of many topics. The survey also asked respondents who at least sometimes get each type of local news how satisfied they are with the news they get. With the exception of weather, fewer than half say they are extremely or very satisfied with the quality of the news they get about each topic. For example, about a quarter of those who consume news about their local economy (26%) say they are extremely or very satisfied with this news. Read more about different local news topics in Chapter 2.

A bar chart showing younger adults are more likely to say that local journalists should advocate for change in the community

When asked whether local journalists should remain neutral on community issues or advocate for change in the community, a majority of Americans (69%) say journalists should remain neutral, reflecting more traditional journalistic norms. However, 29% say that local journalists should be advocating for change in their communities. Younger adults are the most likely to favor advocacy by journalists: 39% of those ages 18 to 29 say that local journalists should push for change, as do 34% of those 30 to 49. Read more about Americans’ views of the role of local journalists in Chapter 4.

Americans who feel a strong sense of connection to their community are more likely to engage with local news, say that local news outlets are important to the community, and rate local media more highly overall. For example, 66% of those who say they are very attached to their community say local news outlets are extremely or very important to the well-being of their local community, compared with 46% of those who are somewhat attached and 31% of those who are not very or not at all attached to their community.

Sign up for our weekly newsletter

Fresh data delivery Saturday mornings

Sign up for The Briefing

Weekly updates on the world of news & information

  • Digital News Landscape
  • Journalists
  • Trust in Media

Introducing the Pew-Knight Initiative

8 facts about black americans and the news, audiences are declining for traditional news media in the u.s. – with some exceptions, how black americans engage with local news, local tv news fact sheet, most popular, report materials.

1615 L St. NW, Suite 800 Washington, DC 20036 USA (+1) 202-419-4300 | Main (+1) 202-857-8562 | Fax (+1) 202-419-4372 |  Media Inquiries

Research Topics

  • Age & Generations
  • Coronavirus (COVID-19)
  • Economy & Work
  • Family & Relationships
  • Gender & LGBTQ
  • Immigration & Migration
  • International Affairs
  • Internet & Technology
  • Methodological Research
  • News Habits & Media
  • Non-U.S. Governments
  • Other Topics
  • Politics & Policy
  • Race & Ethnicity
  • Email Newsletters

ABOUT PEW RESEARCH CENTER  Pew Research Center is a nonpartisan fact tank that informs the public about the issues, attitudes and trends shaping the world. It conducts public opinion polling, demographic research, media content analysis and other empirical social science research. Pew Research Center does not take policy positions. It is a subsidiary of  The Pew Charitable Trusts .

Copyright 2024 Pew Research Center

Here’s why so many Republicans won’t buy EVs

Democrats say they are way more likely than Republicans to buy electric cars. Could that change?

social media sentiment analysis research paper

Electric cars have taken off across the United States. Even amid news of slowing sales , the country sold almost 1.2 million fully electric vehicles in 2023, more than quadruple the number in 2019. Grocery stores and rest stops are installing charging stations across the country; electric cars have moved beyond niche status and are being produced by Ford, GM, Hyundai and many others.

But there is one thing holding the nation back from the dream of an all-electric future: political polarization. Sales data have consistently shown that while Democrats have been buying the new cars in droves, Republicans haven’t jumped onto the EV-buying train.

“The Republican is like, ‘They’re trying to ban gas cars — I’m not going to buy a Biden-mobile,’” said Mike Murphy, a former Republican strategist who runs the nonprofit EV Politics Project, which attempts to counter misinformation on electric cars and encourage conservatives to adopt the vehicles.

Personal cars account for 20 percent of U.S. planet-warming emissions, and more Americans still prefer gas-powered ones. A Washington Post-University of Maryland poll last year found that 46 percent of respondents favored a gas car, compared with 19 percent who wanted a fully electric vehicle. If that doesn’t change, it will be almost impossible for the United States to meet its climate goals.

According to a Gallup poll conducted in March of this year, 61 percent of Democrats reported that they were “seriously considering” or “might consider” buying an EV in the future — compared with only 24 percent of Republicans. At the same time, 69 percent of Republicans said that they “would not buy” an EV in future, compared with 27 percent of Democrats. The difference in Democratic and Republican respondents who owned an EV was within the margin of error.

Actual sales show a partisan trend. According to an analysis from researchers at the University of California at Berkeley, MIT and HEC Montréal, between 2012 and 2022, about half of all EVs sold went to the top 10 percent most Democratic counties in the United States, and about a third of all EVs sold went to the top 5 percent most Democratic counties. That pattern persisted when researchers analyzed the most Democratic states, according to the working paper, which has not yet been peer-reviewed.

The finding held when researchers accounted for income, gas prices and population density. That means that even when looking at dense, urban areas — which are likely to have more public EV charging — Democratic counties outweighed Republican ones in EV adoption.

“There’s an incredible correlation with political ideology,” said Lucas Davis, a professor of business and technology at the University of California at Berkeley and the lead author on the analysis.

During the 10 years of the study, Davis said, the electric vehicle market swelled from just a few all-electric models to dozens, with annual sales growing from around 50,000 EVs to 750,000. But the proportion of those vehicles going to Democratic counties stayed nearly constant. “It’s incredibly consistent across time,” Davis said.

There are other possible explanations, he said: For example, buyers in counties with a higher EV share could notice electric cars more, and ultimately consider buying one themselves.

Still, Democratic areas have much more than their fair share of electric cars. So why did electric cars become — and stay — so polarized?

One reason is that Republican leaders have injected electric vehicles into the culture wars, in light of President Biden’s effort to move the country away from gas cars. Former president Donald Trump has railed against electric cars, calling support of them “electric car lunacy” and the push for an EV future “very, very stupid.”

“I think it is getting more polarized,” Murphy said. “Republicans are instinctively: ‘If Biden’s for it, we’re against.’”

An analysis of social media content by The Post shows that conservative criticism of electric cars rose in August 2022 after California announced rules to phase out fully gas-powered cars by 2035 — and when the Biden administration finalized an Environmental Protection Agency rule in March that would push automakers to make more than 50 percent of cars sold by 2032 all-electric or hybrid.

Conservatives talking about EVs often make a few standard critiques: that switching to electric cars will end up supporting China, that the vehicles are too expensive for ordinary Americans, or that Democrats are taking away Americans’ right to choose which cars they drive.

“It’s a religion,” Tucker Carlson said on Fox News shortly after California announced its plan to phase out gas-powered cars. “It’s about making them feel like good people and increasing their control over you.”

But there are other reasons Republicans might be more reluctant to switch to electric cars. Marc Hetherington, a political science professor at the University of North Carolina at Chapel Hill and a co-author of the 2018 book “Prius or Pickup?” said that political polarization partly stems from deep-seated psychological “worldviews.” Liberals tend to be more afraid of systemic issues like climate change compared to conservatives. Conservatives also tend to be more hesitant to adopt new technologies; liberals are more open to change.

Those preferences can lead Republicans to stick with the large cars and trucks that they know; it can also lead liberals to opt for hybrids or electric vehicles.

When it comes to Democrats adopting EVs, Davis said, “It demonstrates your environmental bona fides to your neighbors.”

And once those habits are created, they become calcified. Cars become not just a way to get around but a form of personal expression, identity and group membership. Democrats see friends and family driving EVs and want to do the same; Republicans hear elite members of their party blasting electric cars and opt to stay with gas.

Some analysts hope to change how conservatives see electric cars. John Marshall, the founder and CEO of the Potential Energy Coalition , a group that uses marketing strategies to boost climate action, says that his group’s research shows that abstract messaging — around EVs as job creators or as a path to energy independence — doesn’t work as well as pointing out how EVs cut pollution and the cost of owning a car. (According to an analysis by the group Energy Innovation, EVs can save $14 to $80 per refuel.)

Talk of banning gas-powered cars also doesn’t help. “Mandates, bans and limitations never win,” Marshall said. “Whenever one of those three things is mentioned, you lose support pretty significantly.”

Murphy said carmakers need to focus primarily on how they can help consumers. “They’re fast, they’re quiet, need much less maintenance,” he said.

Gil Tal, the director of the Electric Vehicle Research Center at the University of California at Davis, says that resistance to new technologies is a normal part of the process. But he worries that this partisan divide could slow the process. “We need long-term investment and commitment in order to complete this push,” he said. “It’s not going to happen by itself.”

About 40 percent of new car buyers are Republicans, Murphy said — without them, the country won’t be able to shift fully to electric vehicles.

One of the big questions going forward is how much of the trend is due to deep-seated disagreement on electric cars — and how much is simply the effect of seeing friends and neighbors owning the vehicles. Multiple studies have shown that actions like buying solar panels or buying EVs are “contagious”; they spread among communities.

That means that if EVs gain a foothold in Republican areas, they could spread quickly.

“If you know someone who has one, you’re far more interested and pro-EV than not,” Murphy said, “because the real-life experience contradicts the myth.”

social media sentiment analysis research paper

  • Share full article

Advertisement

Supported by

Trump Media Stock Rises Again, Adding Billions to Former President’s Stake

In their first few weeks of trading, shares of Truth Social’s parent company soared, then crashed, and have recently made a strong recovery. Former President Donald J. Trump’s holding is now worth about $6 billion.

social media sentiment analysis research paper

By Matthew Goldstein

Shares of former President Donald J. Trump’s social media company have been on a wild ride since making their debut on Wall Street in March: soaring, crashing and then climbing again.

The rally has pushed the value of Mr. Trump’s majority stake in the company to some $6 billion, a major windfall as he ramps up his presidential campaign and faces steep legal bills tied to the multiple cases against him.

On Wednesday, the share price of Trump Media & Technology Group closed around $53, approaching where the parent company of Truth Social ended its first day of trading on March 26, at just under $58.

Mr. Trump owns nearly two-thirds of the company, but he is not allowed to sell his shares or use them as collateral for a few months. Trump Media’s multibillion-dollar market valuation has put it in the same league as established companies like American Airlines and Hasbro.

The frenzied trading in Trump Media has been reminiscent of the mania around so-called meme stocks — shares that trade based on investor sentiment and momentum more than traditional financial fundamentals.

The company last year lost $58 million and took in just $4 million in revenue, all of it from advertising on Truth Social.

“A meme stock has no economic rationale,” said Mike Stegemoller, a finance professor at Baylor University. “It is like someone acting erratically on the street — just get out the way.”

In classic meme-stock fashion, there have been no major developments at Trump Media to propel the shares higher of late. But the stock trades in relation to how investors view Mr. Trump’s prospects in the upcoming presidential election, their fealty to the former president, and simply momentum and emotion.

The recent rise in Trump Media’s share price has also provided a sharp contrast to the scenes in a New York courtroom, where Mr. Trump is on trial for his role in a scheme to conceal hush-money payments to Stormy Daniels, an adult film star, in the final days of the 2016 presidential campaign.

Trump Media merged with a public shell company for its market listing, and more than 400,000 of the shareholders of that company are individual investors. Many of them are fans of Mr. Trump and users of Truth Social.

Digital World Acquisition Corporation, the shell company, began the year trading around $17.50 a share. That means it has been a banner year for early investors in Digital World who are now shareholders of Trump Media.

Michael Melkersen, a lawyer who lives in Puerto Rico, invested in Digital World before its initial public offering in 2021 and now has hundreds of thousands of shares of Trump Media. He said he had taken a gamble that paid off and was now looking to sell some shares.

“I believed in it,” he said, “but I am too heavily invested for my own comfort.”

Recent regulatory filings show that just a handful of investment advisory firms, which manage money for wealthy investors, held a meaningful amount of Trump Media shares at the end of March.

Sapient Capital, which manages $9 billion in assets for wealthy people and foundations, reported holding about 145,000 shares of Trump Media. A founding partner at the Indianpolis-based firm is a brother of Mike Pence, who served as vice president for Mr. Trump. The former vice president said in March that he would not endorse Mr. Trump this year. A Sapient spokesman said the shares were purchased at the request of its clients.

The filings also show that a few investment firms had significant positions in Trump Media warrants — a type of security that can be converted into shares in the future. The firms included Selway Asset Management, Magnus Financial Group and Nine Masts Capital, according to the regulatory tracking service Whalewisdom.

Justin Yochum, a co-owner of Selway, an Idaho-based investment advisory firm, said the firm had bought the warrants as part of a complex trading strategy to profit on the price difference between Trump Media’s shares and its warrants.

“I believe the company itself is worth zero,” Mr. Yochum said.

A few hedge funds reported holding “put” options on the stock. These allow investors to sell shares at a future date at a specified price, and are often bought by traders who think the price of a stock will fall.

A recent analysis of the ownership of Trump Media shares by S&P Global Market Intelligence found that Mr. Trump, corporate insiders and an entity closely affiliated with Trump Media owned nearly 96 percent of the social media company’s stock. Retail investors owned about 3.9 percent of Trump Media’s stock, or roughly 5.4 million shares, and institutional investors owned less than 1 percent.

Trump Media has been pushing back against short-sellers, as investors who bet on a stock’s fall are known, sending letters to stock exchanges and lawmakers asking for investigations into what it describes as “manipulation.”

S3 Partners, a market research firm, estimates that short sellers have incurred losses of $216 million on paper this year by betting against Trump Media.

As it happens, short-sellers have also recently been squeezed by a resurgence of GameStop, the original meme stock. Those losses were even greater, at more than $800 million, according to S3.

The stock roared back to life this week after the apparent return to social media of Keith Gill, a former broker who goes by the moniker “Roaring Kitty,” after a three-year hiatus. Mr. Gill was at the center of the initial meme-stock mania in 2021, when he encouraged investors to buy shares of GameStop, which he argued was undervalued.

GameStop’s stock has roughly doubled since Monday, on little news other than a series of cryptic posts on Mr. Gill’s X account.

Matthew Goldstein covers Wall Street and white-collar crime and housing issues. More about Matthew Goldstein

Survey on sentiment analysis: evolution of research methods and topics

  • Published: 06 January 2023
  • Volume 56 , pages 8469–8510, ( 2023 )

Cite this article

social media sentiment analysis research paper

  • Jingfeng Cui   ORCID: orcid.org/0000-0001-8306-0727 1 , 2 ,
  • Zhaoxia Wang   ORCID: orcid.org/0000-0001-7674-5488 3 ,
  • Seng-Beng Ho   ORCID: orcid.org/0000-0003-4839-1509 1 &
  • Erik Cambria   ORCID: orcid.org/0000-0002-3030-1280 4  

14k Accesses

34 Citations

1 Altmetric

Explore all metrics

Sentiment analysis, one of the research hotspots in the natural language processing field, has attracted the attention of researchers, and research papers on the field are increasingly published. Many literature reviews on sentiment analysis involving techniques, methods, and applications have been produced using different survey methodologies and tools, but there has not been a survey dedicated to the evolution of research methods and topics of sentiment analysis. There have also been few survey works leveraging keyword co-occurrence on sentiment analysis. Therefore, this study presents a survey of sentiment analysis focusing on the evolution of research methods and topics. It incorporates keyword co-occurrence analysis with a community detection algorithm. This survey not only compares and analyzes the connections between research methods and topics over the past two decades but also uncovers the hotspots and trends over time, thus providing guidance for researchers. Furthermore, this paper presents broad practical insights into the methods and topics of sentiment analysis, while also identifying technical directions, limitations, and future work.

Similar content being viewed by others

social media sentiment analysis research paper

A survey on sentiment analysis methods, applications, and challenges

social media sentiment analysis research paper

A review on sentiment analysis and emotion detection from text

social media sentiment analysis research paper

Sentiment Analysis in the Age of Generative AI

Avoid common mistakes on your manuscript.

1 Introduction

Web 2.0 has driven the proliferation of user-generated content on the Internet. This content is closely related to the lives, emotions, and opinions of users. Therefore, analysis of this user-generated data is beneficial for monitoring public opinion and assisting in making decisions. Sentiment analysis, as one of the most popular applications of text-based analytics, can be used to mine people’s attitudes, emotions, appraisals, and opinions about issues, entities, topics, events, and products (Cambria et al. 2022a , b , c , d ; Injadat et al. 2016 ; Jiang et al. 2017 ; Liang et al. 2022 ; Oueslati et al. 2020 ; Piryani et al. 2017 ). Sentiment analysis can help us interpret emotions in unstructured texts as positive, negative, or neutral, and even calculate how strong or weak the emotions are. Today, sentiment analysis is widely used in various fields, such as business, finance, politics, education, and services. This analytical technique has gained broad acceptance not only among researchers but also among governments, institutions, and companies (Khatua et al. 2020 ; Liu et al. 2012 ; Sánchez-Rada and Iglesias 2019 ; Wang et al. 2020b ). It helps policy leaders, businessmen, and service people make better decisions.

The majority of user-generated content data is unstructured text, which increases the great difficulty of sentiment analysis. Since 2000, researchers have been exploring techniques and methods to enhance the accuracy of such analysis. The popularity of social media platforms has brought people around the world closer together. With the continuous advancement of technology, the research topics, application fields, and core methods and technologies of sentiment analysis are also constantly changing.

Comparing and analyzing papers from specific disciplines can help researchers gain a comprehensive understanding of the field. There have been many surveys on sentiment analysis (Nair et al. 2019 ; Obiedat et al. 2021 ; Raghuvanshi and Patil 2016 ). However, there is a lack of adequate discussion on the connections between research methods and topics in the field, as well as on their evolution over time. In 1983, Callon et al. proposed co-word analysis (Callon et al. 1983 ). It can effectively reflect the correlation strength of information items in text data. Co-word analysis based on the frequency of co-occurrence of keywords used to describe papers can reveal the core contents of the research in specific fields. An evolutionary analysis of the associations between core contents is helpful for a comprehensive understanding of the research hotspots and frontiers in the field (Deng et al. 2021 ). It can provide guidance for researchers, especially those who are new to the field, and help them determine research directions, avoid repetitive research, and better discover and grasp the research trends in this field (Wang et al. 2012 ). To fill in the gap in existing research, we conduct keyword co-occurrence analysis and evolution analysis with informetric tools to explore the research hotspots and trends of sentiment analysis.

The main contributions of this survey are as follows:

Using keyword co-occurrence analysis and the informetric tools, the paper presents a survey on sentiment analysis, explores and discovers useful information.

A keyword co-occurrence network is constructed by combining the paper title, abstract, and author keywords. Through the keyword co-occurrence network and community detection algorithm, the research methods and topics in the field of sentiment analysis, along with their evolution in the past two decades, are discussed.

The paper summarizes the research hotspots and trends in sentiment analysis. It also highlights practical implications and technical directions.

The remainder of this paper is organized as follows: In Sect.  2 , we summarize and analyze the existing surveys on sentiment analysis and present the research purpose and methodologies of this paper. Section  3 details the survey methodology, including the collection and processing of scientific publications, visualization, and analysis using different methods and tools. In Sect.  4 , we analyze the results obtained from the keyword co-occurrence analysis and evolution analysis, along with the research hotspots and trends in sentiment analysis identified through the analysis results. Finally, in Sect.  5 , we summarize the research conclusions as well as the practical implications and technical directions of sentiment analysis. We also clarify the limitations of this paper and make suggestions for future work.

2 Existing surveys on sentiment analysis

Sentiment analysis is a concept encompassing many tasks, such as sentiment extraction, sentiment classification, opinion summarization, review analysis, sarcasm detection or emotion detection, etc. Since the 2000s, sentiment analysis has become a popular research field in natural language processing (Hussein 2018 ). In the existing surveys, the researchers mainly conducted specific analyses of the tasks, technologies, methods, analysis granularity, and application fields involved in the sentiment analysis process.

2.1 Surveys on contents and topics of sentiment analysis

When research on sentiment analysis was still in its infancy, the contents and topics of surveys mainly focused on sentiment analysis tasks, analysis granularity, and application areas. Kumer et al. reviewed the basic terms, tasks, and levels of granularity related to sentiment analysis (Kumar and Sebastian 2012 ). They also discussed some key feature selection techniques and the applications of sentiment analysis in business, politics, recommender systems and other fields. Nassirtoussi et al. explored the application of sentiment analysis in market prediction (Nassirtoussi et al. 2014 ). Medhat et al. analyzed the improvement of the algorithms proposed in 2010–2013 and their application fields (Medhat et al. 2014 ). Ravi et al. analyzed the papers related to opinion mining and sentiment analysis from 2002 to 2015. Their study mainly discussed the necessary tasks, methods, applications, and unsolved problems in the field of sentiment analysis (Ravi and Ravi 2015 ).

Existing surveys of the applications of sentiment analysis have focused more on the domains of market research, medicine, and social media in recent years. Rambocas et al. examined the application of sentiment analysis in marketing research from three main perspectives, including the unit of analysis, sampling design, and methods used in sentiment detection and statistical analysis (Rambocas and Pacheco 2018 ). Cheng et al. summarized techniques based on semantic, sentiment, and event extraction, as well as hybrid methods employed in stock forecasting (Cheng et al. 2022 ). Yue et al. categorized and compared a large number of techniques and approaches in the social media domain. That study also introduced different types of data and advanced research tools, and discussed their limitations (Yue et al. 2019 ). In the context of the COVID-19 epidemic, Alamoodi et al. reviewed and analyzed articles on the occurrence of different types of infectious diseases in the past 10 years. They reviewed the applications of sentiment analysis from the identified 28 articles, summarizing the adopted techniques such as dictionary-based models, machine learning models, and mixed models (Alamoodi et al. 2021b ); Alamoodi et al. also conducted a review of the applications of sentiment analysis for vaccine hesitancy (Alamoodi et al. 2021a ). Researchers also reviewed the application of sentiment analysis in the fields of election prediction (Brito et al. 2021 ), education (Kastrati et al. 2021 ; Zhou and Ye 2020 ) and service industries (Adak et al. 2022 ).

Quite a number of research works investigated sentiment analysis works in non-English languages. Sentiment analysis in Chinese (Peng et al. 2017 ), Arabic (Al-Ayyoub et al. 2019 ; Boudad et al. 2018 ; Nassif et al. 2021 ; Oueslati et al. 2020 ), Urdu (Khattak et al. 2021 ), Spanish (Angel et al. 2021 ), and Portuguese (Pereira 2021 ) were conducted. They mainly reviewed the classification frameworks of the sentiment analysis process, supported language resources (dictionaries, natural language processing tools, corpora, ontologies, etc.), and deep learning models used (CNN, RNN, and transfer learning) for each of the languages involved.

2.2 Surveys on methods of sentiment analysis

Before machine learning technology became mature, researchers were particularly concerned about feature extraction methods. For example, Feldman summarized methods for extracting preferred entities from indirect opinions and methods for dictionary acquisition (Feldman 2013 ). Asghar et al. reviewed the natural language processing techniques for extracting features based on part of speech and term position; statistical techniques for extracting features based on word frequency and decision tree model; and techniques for combining part of speech tagging, syntactic feature analysis, and dictionaries (Asghar et al. 2014 ). Koto et al. discussed the best features for Twitter sentiment analysis prior to 2014 by comparing 9 feature sets (Koto and Adriani 2015 ). They found that the current best features for sentiment analysis of Twitter texts are AFINN (a list of English terms used for sentiment analysis manually rated by Finn Årup Nielsen) (Nielsen 2011 ) and Senti-Strength (Thelwall et al. 2012 ). Taboada sorted out the characteristics of words, phrases, and sentence patterns in sentiment analysis from the perspective of linguistics (Taboada 2016 ). Besides, Schouten and Frasinar conducted a comprehensive and in-depth critical evaluation of 15 sentiment analysis web tools (Schouten and Frasincar 2015 ). Medhat et al. ( 2014 ) and Ravi et al. (Ravi and Ravi 2015 ) also analyzed the early algorithms for sentiment analysis.

In the study by Schouten et al., the authors focused on aspect-level sentiment analysis, combing the techniques of aspect-level sentiment analysis before 2014, such as frequency-based, syntax-based, supervised machine learning, unsupervised machine learning, and hybrid approaches. They concluded that the latest technology was moving beyond the early stages (Schouten and Frasincar 2015 ). As research into sentiment analysis became more and more popular and there was important progress made in the development of deep learning technologies, researchers started to pay more attention to the techniques and methods of sentiment analysis. Deep learning methods in particular became the focus of discussions among researchers.

Prabha et al. analyzed various deep learning methods used in different applications at the level of sentence and aspect/object sentiment analysis, including Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and Long Short-term Memory (LSTM) (Prabha and Srikanth 2019 ). They discussed the advantages and disadvantages of these methods and their performance parameters. Ain et al. introduced deep learning techniques such as Deep Neural Network (DNN), CNN and Deep Belief Network (DBN) to solve sentiment analysis tasks like sentiment classification, cross-lingual problems, and product review analysis (Ain et al. 2017 ). Zhang et al. investigated deep learning and machine learning techniques for sentiment analysis in the contexts of aspect extraction and categorization, opinion expression extraction, opinion holder extraction, sarcasm analysis, multimodal data, etc. (Zhang et al. 2018 ). Habimana et al. compared the performance of deep learning methods on specific datasets and proposed that performance could be improved using models including Bidirectional Encoder Representations from Transformers (BERT), sentiment-specific word embedding models, cognitive-based attention models, and commonsense knowledge (Habimana et al. 2020 ). Wang et al. reviewed and discussed existing analytical models for sentiment classification and proposed a computational emotion-sensing model (Wang et al. 2020b ).

Some researchers also discussed web tools (Zucco et al. 2020 ), fuzzy logic algorithms (Serrano-Guerrero et al. 2021 ), transformer models (Acheampong et al. 2021 ), and sequential transfer learning (Chan et al. 2022 ) for sentiment analysis.

2.3 Overall survey methodology

With the increase in the popularity of sentiment analysis research, more related research results began to accumulate. Researchers needed to systematically organize and analyze results from a large number of publications to perform literature reviews. They used different survey methodologies to conduct surveys of a large number of papers.

Content analysis is a powerful approach to characterizing the contents of each study by carefully reading its content and manually identifying, coding, and organizing key information in it. A literature review is formed as a result of the repeated use of this approach (Elo and Kyngäs 2008 ; Stemler 2000 ). Content analysis has been used for different studies and systematic reviews (Qazi et al. 2015 , 2017 ). For example, Birjali et al. have studied the most commonly used classification techniques in sentiment analysis from a large amount of literature and introduced the application areas and sentiment classification processes, including preprocessing and feature selection (Birjali et al. 2021 ). They conducted a comprehensive analysis of the papers, discovering that supervised machine learning algorithms are the most commonly used techniques in the field. A complete review of methods and evaluation for sentiment analysis tasks and their applications was conducted by Wankhade et al. ( 2022 ). They compared the strengths and weaknesses of the methods, and discussed the future challenges of sentiment analysis in terms of both the methods and the forms of the data. Although this method can review the research contents and penetrate into the cores of the papers most systematically, it requires a considerable amount of manpower and time for in-depth literature reading.

The systematic literature review guideline proposed by Kitchenham and Charters has gradually attracted the attention of researchers (Kitchenham 2004 ; Kitchenham and Charters 2007 ; Sarsam et al. 2020 ). This review process is divided into six stages: research question definition, search strategy formulation, inclusion and exclusion criteria definition, quality assessment, data extraction, and data synthesis. Researchers can eliminate a large number of retrieved papers by using this standard process and finally conducting further analysis and research on a small number of papers. Kumar et al. reviewed context-based sentiment analysis in social multimedia between 2006 and 2018. From the 573 papers retrieved in the initial search, they finally selected 37 papers to use in discussing sentiment analysis techniques (Kumar and Garg 2020 ). This approach was also used by Kumar et al. in their research on sentiment analysis on Twitter using soft computing techniques. They selected 60 articles out of 502 for follow-up analysis (Kumar and Jaiswal 2020 ). Zunic et al. selected 86 papers from 299 papers retrieved in the period 2011–2019 to discuss the application of sentiment analysis techniques in the field of health and well-being (Zunic et al. 2020 ); Ligthart et al. followed Kitchenham’s guideline and identified 14 secondary studies. They provided an overview of specific sentiment analysis tasks and of the features and methods required for different tasks (Ligthart et al. 2021 ). Obiedat (Obiedat et al. 2021 ), Angel (Angel et al. 2021 ) and Lin (Lin et al. 2022 ) also all followed this guideline to select literature for further analysis. This method can reduce the amount of literature that requires in-depth reading, but in the case of a large amount of literature, more effort is still required to search and screen the material than in traditional literature review methods (Kitchenham and Charters 2007 ).

There are also a few authors who have used informetric methods to review papers. Piryani et al. conducted an informetric analysis of research on opinion mining and sentiment analysis from 2000 to 2015 (Piryani et al. 2017 ). The authors used social network analysis, literature co-citation analysis, and other methods in the paper. They analyzed publication growth rates; the most productive countries, institutions, journals, and authors; and topic density maps and keyword bursts, among other elements. To a certain extent, they interpreted core authors, core papers, areas of research focus in this field, and the current state of national cooperation. In order to explore the application of sentiment analysis in building smart societies, Verma collected 353 papers published between 2010 and 2021 (Verma 2022 ). Using a topic analysis perspective combined with the Louvain algorithm, the author identified four sub-topics in the research field. Similarly, Mantyla et al. employed LDA techniques and manual classification to explore the topic structures of sentiment analysis articles (Mäntylä et al. 2018 ). The informetric methods use natural language processing technologies to intuitively conduct topic mining and analysis of a large number of papers. Through topic clustering, the literature is organized and analyzed, which reduces the time researchers spend on reading the literature in depth. These methods are suitable for exploring research topics and trends in the field.

2.4 Summary of advantages and disadvantages of the existing surveys

In the following, we discuss the advantages and disadvantages of the existing surveys from a number of different points of view.

2.4.1 From the point of view of the contents and topics of sentiment analysis

As summarized in Table 1 , the researchers organized the literature and conducted depth investigations of the contents and topics of sentiment analysis. They reviewed the tasks of sentiment analysis (e.g., different text granularity, opinion mining, spam review detection, and emotion detection), the application areas of sentiment analysis (e.g. market, medicine, social media, and election prediction), and different languages for sentiment analysis, such as Chinese, Spanish, and Arabic (Adak et al. 2022 ; Al-Ayyoub et al. 2019 ; Alamoodi et al. ( 2021a , b ); Alonso et al. 2021 ; Angel et al. 2021 ; Boudad et al. 2018 ; Brito et al. 2021 ; Cheng et al. 2022 ; Hussain et al. 2019 ; Kastrati et al. 2021 ; Khattak et al. 2021 ; Koto and Adriani 2015 ; Kumar and Sebastian 2012 ; Ligthart et al. 2021 ; Medhat et al. 2014 ; Nassif et al. 2021 ; Nassirtoussi et al. 2014 ; Oueslati et al. 2020 ; Peng et al. 2017 ; Pereira 2021 ; Rambocas and Pacheco 2018 ; Ravi and Ravi 2015 ; Schouten and Frasincar 2015 ; Sharma and Jain 2020 ; Yue et al. 2019 ; Zhou and Ye 2020 ). They summarized the methods and application prospects of sentiment analysis under different contents and topics. As the field has grown, new topics have emerged, and knowledge from other fields has been gradually integrated into it. In recent years, the popularity of social media has aroused increasing interest in sentiment analysis research, and the number of papers published, especially those related to different topics of sentiment analysis, has grown rapidly. However, the existing surveys cover a short time range, and there has not been a survey dedicated to the evolution of research contents or topics of sentiment analysis. There have also been few survey works analyzing the connections between topics and methods, or their evolution (e.g., how the contents and topics of sentiment analysis have changed over time).

2.4.2 From the point of view of the methods of sentiment analysis

Some researchers reviewed different techniques and methods of sentiment analysis in different application areas and tasks. They analyzed and discussed sentiment analysis methods based on lexicons, rules, part of speech, term position, statistical techniques, supervised and unsupervised machine learning methods, as well as deep learning methods like LSTM, CNN, RNN, DNN, DBN, BERT, and other hybrid approaches (Acheampong et al. 2021 ; Ain et al. 2017 ; Alamoodi et al. 2021b ; Asghar et al. 2014 ; Chan et al. 2022 ; Cheng et al. 2022 ; Feldman 2013 ; Habimana et al. 2020 ; Koto and Adriani 2015 ; Kumar, Akshi and Sebastian 2012 ; Medhat et al. 2014 ; Prabha and Srikanth 2019 ; Ravi and Ravi 2015 ; Schouten and Frasincar 2015 ; Serrano-Guerrero et al. 2021 ; Taboada 2016 ; Wang et al. 2020b ; Yue et al. 2019 ; Zhang et al. 2018 ; Zucco et al. 2020 ). These researchers also compared the advantages and disadvantages of each method. As summarized in Table 1 , even though existing surveys analyze the techniques and methods of sentiment analysis, providing good insights, there has not been a survey that analyzes the evolution of research methods over time. There have also been few survey works that focuses on the connections between topics and methods of sentiment analysis, and their evolution over time.

2.4.3 From the point of view of the overall survey methodology

The survey methods used have mainly been the content analysis method, Kitchenham and Charters' guideline, and the informetric methods. As summarized in Table 1 , the content analysis method can effectively analyze the contents of research papers in depth, but it does not address the issue of the evolution of the research methods and topics (Bengtsson 2016 ; Birjali et al. 2021 ; Elo and Kyngäs 2008 ; Krippendorff 2018 ; Qazi et al. 2015 , 2017 ; Wankhade et al. 2022 ). Although the number of papers that need to be read in depth can be reduced by following Kitchenham and Charters' guideline, more effort is needed to search and screen literature than in traditional literature review methods (Angel et al. 2021 ; Kitchenham 2004 ; Kitchenham and Charters 2007 ; Kumar and Garg 2020 ; Ligthart et al. 2021 ; Lin et al. 2022 ; Obiedat et al. 2021 ; Sarsam et al. 2020 ; Zunic et al. 2020 ). The informetric methods are best suited to investigating the research methods and topics of sentiment analysis (Bar-Ilan 2008 ; Mäntylä et al. 2018 ; Piryani et al. 2017 ; Santos et al. 2019 ; Verma 2022 ). There are three surveys using informetric techniques and tools that are well suited for analysis of a large number of papers over many years (Mäntylä et al. 2018 ; Piryani et al. 2017 ; Verma 2022 ). However, the evolution of research methods and topics of sentiment analysis over time has not been studied with informetric methods. There have also been few survey works that leverages keyword co-occurrence analysis and community detection to analyze the connections between research methods and topics, and their evolution over time.

Therefore, to address the gaps in the existing surveys, this study presents a survey on the research methods and topics, and their evolution over time. It combines keyword co-occurrence analysis and informetric analysis tools to reveal the methods and topics of sentiment analysis and their evolution in this field from 2002 to 2022.

The following section, Sect.  3 , describes our proposed survey methodology in detail.

3 The proposed survey methodology

This section describes our proposed survey methodology, including collection of scientific publications, processing of scientific publications, as well as visualization and analysis using different methods and tools. The overall scheme of this survey (Fig.  2 ) is also presented in the end of Sect.  3 to better visualize and summarize the proposed survey methodology in this research.

3.1 Collection of scientific publications

We collected research data from the Web of Science platform. We used keywords such as "sentiment analysis," "sentiment mining," and "sentiment classification" to search for relevant papers as data samples. In examining the retrieved papers, we found that some paper topics, paper types, and publication journals were not related to sentiment analysis, so we excluded them. The papers we included were mainly related to the sentiment analysis of texts. We excluded papers on sentiment analysis related to image processing, video processing, speech processing, biological signal processing, etc. Therefore, the retrieval strategy was as follows:

Topic Search (TS) = ("sentiment analy*" or "sentiment mining" or "sentiment classification") And Abstract (AB) = "sentiment" NOT TS = ("face image*" or "speech recognition" or "speech emotion" or "physiological signal*" or "music emotion*" or "facial feature extraction" or "video emotion" or "electroencephalography " or "biosignal*" or "image process*") NOT Title = ("facial" or "speech" or "sound*" or "face" or "dance" or "temperature" or "image*" or "spoken" or "electroencephalography" or "EEG" or "biosignal*" or "voice*" not AB = "facial."

The results in conferences are given the same relevance as journal papers. We chose four databases in the Web of Science: two conference citation databases (Conference Proceedings Citation Index—Social Sciences & Humanities [CPCI-SSH], and Conference Proceedings Citation Index—Science [CPCI-S]), and two journal citation databases (Science Citation Index Expanded [SCI-Expanded] and Social Sciences Citation Index [SSCI]). Given the various forms of words such as "analyzing" and "analysis," a truncated search technique (marked with an asterisk) was used to prevent the omission of relevant papers. The time frame of the retrieved papers was from January 2002 to January 2022, and the publication types of the papers included "article," "conference paper," "review," and "edited material." A total of 9,714 papers were obtained from the four databases above. These included 3,809 articles, 5,633 proceeding papers, 267 reviews, and 5 pieces of editorial material from 2002 to 2022. Overall, there were 104 papers from January 2022. The number of papers each year from 2002 to 2021 is shown in Fig.  1 .

figure 1

The number of papers each year from 2002 to 2021

3.2 Processing of scientific publications

In this process, our purpose was to extract the key contents of the papers, which are used to analyze the research methods and topics in the field of sentiment analysis. Due to their limited number, the author keywords in each paper often cannot fully represent the key content of the paper. We found that combining the title and abstract could better reflect the core information. Therefore, we synthesized the title, abstract, and author keywords of each paper to extract keywords that represented the main research method and topic of the paper involved using KeyBERT Footnote 1 . KeyBERT is a keyword extraction technique that uses BERT embedding to create keywords and key phrases that most closely resemble document content (Grootendorst and Warmerdam 2021 ). The specific keyword extraction process was as follows:

First, we used KeyBERT to extract 8 keywords and eliminated keywords with a weight lower than 0.3. We then combined the extracted keywords with the author keywords and removed duplicates. After that, we standardized the whole collection of keywords and merged synonyms. Finally, we counted the number of keywords and removed meaningless terms like "sentiment analysis," "sentiment classification," and "sentiment mining."

After statistical analysis, we obtained 41,827 keywords with a total word frequency of 88,104. As there were 9,714 papers and 41,827 keywords, we found that most of the keywords with word frequency below 10 were not representative of the research contents of sentiment analysis. As a result, a total of 685 representative keywords were reserved for subsequent analysis. These keywords appeared a total of 30,801 times. Table 2 shows the keywords with word frequency in the top 50.

High-frequency keywords generally represent research hotspots. We therefore extracted high-frequency keywords to serve as the basis for the subsequent analysis. We found that most of the keywords with word frequency 18 and lower, such as "ranking," "mask," "experience," "affect," "online forum," and so on, were not relevant to sentiment analysis. Therefore, the keywords with a word frequency higher than 18 were reserved for analysis. These keywords appeared 25,429 times in the collected data, accounting for close to 83% of all the keywords. We obtained 275 keywords, which were used to analyze the main methods and topics of sentiment analysis.

3.3 Visualization and analysis using different methods and tools

3.3.1 analytical methods.

Keywords are the core natural language vocabulary to express the subject, content, ideas, and research methods of the literature (You et al. 2021 ). Keywords represent the topics of the domain, and cluster analysis of these words can reflect the structure and association of topics. Keyword co-occurrence analysis counts the number of occurrences of a set of keywords in the same document. The strength and number of associations between research contents can be obtained through keyword co-occurrence analysis. Dividing research methods and topics into sub-communities helps researchers to analyze hotspots and trends in methods and topics, as well as to obtain sub-fields of sentiment analysis research (Ding et al. 2001 ).

3.3.2 Visualization and analysis tools

BibExcel Footnote 2 is a software tool for analyzing bibliographic data or any text-based data formatted in a similar way (Persson 2017 ). The tool generates structured data files that can be read by Excel for subsequent processing (Persson et al. 2009 ). Our processing steps are as follows. First, we imported the standardized bibliographic data into BibExcel. This tool can help structure the data. Second, we checked and corrected the data and used BibExcel to count the number of co-occurrences of keywords.

We then used Pajek Footnote 3 software to visualize the keyword co-occurrence network and divided the sub-communities. Pajek is a large and complex network analysis tool (Batagelj and Andrej 2022 ; Batagelj and Mrvar 1998 ). It can calculate certain indicators to reveal the state and properties of the network involved. In addition, Pajek’s Louvain community detection algorithm can help divide the keyword co-occurrence network into sub-communities, which represent sub-fields of sentiment analysis (Blondel et al. 2008 ; Leydesdorff et al. 2014 ; Rotta and Noack 2011 ). The Louvain community-detection algorithm unfolds a complete hierarchical community structure for the network. It has an advantage in subdividing different areas of study: multiple knowledge structures and details can be shown in one network (Deng et al. 2021 ).

After that, we applied VOSviewer Footnote 4 to optimize the visualization of sub-communities (Van Eck and Waltman 2010 ; VOSviewer 2021 ; Perianes-Rodriguez et al. 2016 ; Waltman and Van Eck 2013 ; Waltman et al. 2010 ). VOSviewer can help display the core keywords in each sub-community and the correlation between keywords. It can also reflect the closeness of the association between sub-communities. Finally, we used Excel to count the frequency of keywords for each year and to map the evolution of research methods and topics in the field of sentiment analysis.

3.3.3 Graphical representation of the overall scheme of this survey

This paper proposes and conducts a new research survey on sentiment analysis. The graphical representation of the overall scheme of this survey is shown in Fig.  2 . The main scheme includes four modules: Module A, Collection of scientific publications; Module B, Processing of scientific publications; Module C, Visualization and analysis through different methods and tools, and Module D, Result analysis and discussions based on various aspects.

figure 2

Graphical representation of the overall scheme of this survey. Module A: Collection of scientific publications; Module B: Processing of scientific publications; Module C: Visualization and analysis using different methods and tools; Module D: Result analysis and discussions considering various aspects

In Module A, scientific publications are collected from the Web of Science (WOS) platform, as has been detailed in Sect.  3.1 Collection of scientific publications above. Module B, Processing of scientific publications, has been detailed in Sect.  3.2 above. It performs a data processing procedure to obtain key information, which includes all the representative keywords and high-frequency keywords. The title, abstract and keywords of the papers are used to extract such key information using KeyBERT (Grootendorst and Warmerdam 2021 ). Such key information is analyzed and visualized through different methods, including different visualization tools, as introduced in Sect.  3.3 (Module C), Visualization and analysis using different methods and tools, above.

In Module C, the number of co-occurrences of keywords is obtained using BibExcel (Persson 2017 ), the co-occurrences of keywords are analyzed and visualized using Pajek (Blondel et al. 2008 ; Leydesdorff et al. 2014 ; Rotta and Noack 2011 ) and VOSviewer (Van Eck and Waltman 2010 ; VOSviewer 2021 ; Perianes-Rodriguez et al. 2016 ; Waltman and Van Eck 2013 ; Waltman et al. 2010 ). The keyword community network and the keyword community evolution are analyzed and visualized using these tools, as described in Sect.  3.3 (Module C), Visualization and analysis using different methods and tools. According to the visualization and analysis results obtained in Module C, Module D, Result analysis and discussions, will be detailed in Sect.  4 .

In the following section, Sect.  4 (Module D), results are analyzed and discussed considering various aspects, including the research methods and topics of sentiment analysis in each community, the evolution of research methods and topics along with the research hotspots and trends over time.

4 Results and analysis through various aspects

4.1 research methods and topics of sentiment analysis, 4.1.1 overall characteristic analysis.

The high-frequency keywords were presented in Table 2 . These keywords can be regarded as the main research contents in the field of sentiment analysis. "Twitter" ranks at the top. It is followed by "opinion mining," "natural language processing," "machine learning," and so on. The high-frequency keywords cover the topics of the studies, the contents of the studies, and the techniques and methods used. Based on these keywords, we used Pajek’s Louvain method to construct a keyword co-occurrence network to represent the research methods and topics as shown in Fig.  3 . The keyword co-occurrence network is divided into six communities. The research methods and topics of the six communities include social media platforms (C1), machine learning methods (C2), natural language processing and deep learning methods (C3), opinion mining and text mining (C4), Arabic sentiment analysis (C5), and others, such as domain sentiment analysis and transfer learning, etc. (C6).

figure 3

Keyword community network

In Fig.  3 , the size of the node represents the number of keywords. The thickness of the line between the nodes represents the number of collaborations between keywords. The top 20 keywords in each community are sorted in descending order, as shown in Table 3 . The keyword co-occurrence network features of the six sub-communities are described in Table 4 . The number of nodes shows the number of keywords in each community, and the number of links shows the correlations between the keywords.

As shown in Table 4 , we can see from the number of links between sub-communities that there is a strong correlation between them, especially the link between C3 and C4, which has 1306 lines. The reason may be that the research methods of C4 focus on "opinion mining" and "text mining," while those of C3 focus on "natural language processing" and "deep learning," and C3 provides more technical support for C4 research. In C5 and C6, the research methods and topics are scattered. Their internal links are also low, but the connections with C3 and C4 are relatively high. The contents of C5 and C6 may include some emerging research methods and topics. We will present a specific analysis on the methods and topics of each sub-community in the next subsection.

4.1.2 Analysis on research methods and topics of sub-communities

4.1.2.1 analysis on research methods and topics of the c1 community.

Figure  4 shows the keyword co-occurrence network of the C1 community. The research methods and topics of the C1 community focus on three areas: "social media," "topic models," and "covid-19." In the context of big data, web 2.0 technology provides users with a way to express reviews and opinions of services, events, and people. Various social media platforms, such as Twitter, YouTube, and Weibo, have a large amount of users’ emotional data (Momtazi 2012 ). Compared to traditional news media, information on social media spreads more quickly, and people are able to express their feelings more freely. It is important to analyze the emotions generated by the information shared and published on social media (Abdullah and Zolkepli 2017 ; Wang et al. 2014 ). Researchers have been extracting text data from social media platforms for years to detect unexpected events (Bai and Yu 2016 ; Preethi et al. 2015 ), improve the quality of products (Abrahams et al. 2012 ; Isah et al. 2014 ; Myslin et al. 2013 ), understand the direction of public opinion (Fink et al. 2013 ; Groshek and Al-Rawi 2013 ), and so on.

figure 4

The keyword co-occurrence network for the C1 community

Users’ sentiments are often associated with the topics, and the accuracy of sentiment analysis can be improved through the introduction of topic models (Li et al. 2010 ). Among them, the Latent Dirichlet Allocation (LDA) method is cited most frequently. Previous studies found that the LDA method can be effective in subdividing topics and identifying the sentiments of the contents. This method is quite general, and there are also many improved models based on this one that can be applied to any type of web text, helping to enhance the accuracy of sentiment polarity calculation (Chen et al. 2019 ; Liu et al. 2020 ).

As the COVID-19 pandemic has unfolded, a large number of individuals, media and governments have been publishing news and opinions about the COVID-19 crisis on social media platforms. This has resulted in a lot of sentiment analysis studies focusing on COVID-19-related texts exploring the impact of the epidemic on people’s lives (Sari and Ruldeviyani 2020 ; Wang, T. et al. 2020a ), physical health (Berkovic et al. 2020 ; Binkheder et al. 2021 ) and mental health (Yin et al. 2020 ), and so on. Therefore, we can see many related keywords, such as "infodemiology," "healthcare," and "mental health."

4.1.2.2 Analysis on research methods and topics of the C2 community

The contents of the C2 community mainly focus on "machine learning," "text classification," "feature extraction," and "stock market" (see Fig.  5 ). Most keywords are related to the research methods of sentiment analysis. Machine learning approaches have expanded from topic recognition to more challenging tasks such as sentiment classification. It is very important to explore and compare machine learning methods applied to sentiment classification (Li and Sun 2007 ). Methods like Support Vector Machine (SVM) and Naive Bayes models are widely used (Altrabsheh et al. 2013 ; Dereli et al. 2021 ; Shofiya and Abidi 2021 ; Tan et al. 2009 ; Wang and Lin 2020 ) and are used as benchmarks for the comparisons of models proposed by many researchers (Kumar et al. 2021 ; Sadamitsu et al. 2008 ; Waila et al. 2012 ; Zhang et al. 2019 ). Many algorithms, such as random forest (Al Amrani et al. 2018 ; Fitri et al. 2019 ; Sutoyo et al. 2022 ), tf-idf (Arafin Mahtab et al. 2018 ; Awan et al. 2021 ; Dey et al. 2017 ), logistic regression (Prabhat and Khullar 2017 ; Qasem et al. 2015 ; Sutoyo et al. 2022 ), and n-gram (Ikram and Afzal 2019 ; Singh and Kumari 2016 ; Xiong et al. 2021 ) are used to enhance the accuracy of machine learning, as shown in Fig.  5 .

figure 5

The keyword co-occurrence network for the C2 community

The trading volume and asset prices of financial commodities or financial instruments are influenced by a variety of factors in the online environment. Machine learning and sentiment analysis are powerful tools that can help gather vast amounts of useful information to predict financial risk effectively (Li et al. 2009 ). Research on the relationship between public sentiment and stock prices has always been the focus of many scholars (Smailović et al. 2014 ; Xing et al. 2018 ). They have used machine learning methods to explore the influence of sentiments on stock prices through sentiment analysis of news articles, and then predicted the trend changes in the stock market (Ahuja et al. 2015 ; Januário et al. 2022 ; Maqsood et al. 2020 ; Picasso et al. 2019 ).

4.1.2.3 Analysis on research methods and topics of the C3 community

The contents of the C3 community also mainly focus on the methods for sentiment analysis, like "natural language processing", "deep learning," "aspect-based sentiment analysis," and "task analysis" (Fig.  6 ). Sentiment analysis is a sub-field of natural language processing (Nicholls and Song 2010 ), and natural language processing techniques have been widely used in sentiment analysis. Using natural language processing technology can help to better parse text features, such as part-of-speech tagging, word sense disambiguation, keyword extraction, inter-word dependency recognition, semantic parsing, and dictionary construction (Abbasi et al. 2011 ; Syed et al. 2010 ; Trilla and Alías 2009 ). With the rise of deep learning technology, researchers began to introduce it to sentiment analysis. Neural network models like LSTM (Al-Dabet et al. 2021 ; Al-Smadi et al. 2019 ; Li and Qian 2016 ; Schuller et al. 2015 ; Tai et al. 2015 ), CNN (Cai and Xia 2015 ; Jia and Wang 2022 ; Ouyang et al. 2015 ), RNN (Hassan and Mahmood 2017 ; Tembhurne and Diwan 2021 ; You et al. 2016 ), and some combination of these, as well as other models (An and Moon 2022 ; Li et al. 2022 ; Liu et al. 2020a ; Salur and Aydin 2020 ; Zhao et al. 2021 ), have received significant attention.

figure 6

The keyword co-occurrence network for the C3 community

Sentiment analysis granularity is subdivided into document level, sentence level, and aspect level. Document-level sentiment analysis takes the entire document as a unit, but the premise is that the document needs to have a clear attitude orientation—that is, the point of view needs to be clear (Shirsat et al. 2018 ; Wang and Wan 2011 ). Sentence-level sentiment analysis is intended to perform sentiment analysis of the sentences in the document alone (Arulmurugan et al. 2019 ; Liu et al. 2009 ; Nejat et al. 2017 ). Aspect-based analysis is a fundamental and significant task in sentiment analysis. The aim of aspect-level sentiment analysis is to separately summarize positive and negative views about different aspects of a product or entity, although overall sentiment toward a product or entity may tend to be positive or negative (Rao et al. 2021 ; Thet et al. 2010 ). Aspect-level sentiment analysis facilitates a more finely-grained analysis of sentiment than either document or sentence-level analysis (Liang et al. 2022 ; Wang et al. 2020c ). The traditional levels of analysis, such as sentence-level analysis can only calculate the comprehensive sentiment polarity of paragraphs or sentences (Wang et al. 2016 ; Zhang et al. 2021 ). In recent years, the aspect level has become more and more popular, and with the application of deep learning technology, it has become better at capturing the semantic relationship between aspect terms and words in a more quantifiable way (Huang et al. 2018 ). The process of sentiment analysis involves the coordination of multiple tasks, and the subtasks include feature extraction (Bouktif et al. 2020 ; Lin et al. 2020 ), context analysis (Yu et al. 2019 ; Zuo et al. 2020 ), and the application of some analytical models (Tan et al. 2020 ).

4.1.2.4 Analysis on research methods and topics of the C4 community

The C4 community mainly shows keywords related to the research methods and topics of "opinion mining" and "user review," which is the largest of the six sub-communities (Fig.  7 ). With the popularity of platforms like online review sites and personal blogs on the Internet, opinions and user reviews are readily available on the web. Opinion mining has always been a hot field of research (Khan et al. 2009 ; Poria et al. 2016 ). From Table 4 , we can see that the link between C3 and C4 has 1306 lines. In opinion mining, researchers use many text mining methods to discover users’ opinions on goods or services, and then help improve the quality of corresponding products or services (Da’u et al. 2020 ; Lo and Potdar 2009 ; Martinez-Camara et al. 2011 ). In addition, scholars have found that the consideration of user opinions can help improve the overall quality of recommender systems (Artemenko et al. 2020 ; Da’u et al. 2020 ; Garg 2021 ; Malandri et al. 2022 ). Therefore, "recommendation system" has a strong correlation with "opinion mining."

figure 7

The keyword co-occurrence network for C4 community

Evaluation metrics for quantifying the existing approaches are also a popular topic related to opinion mining. There is a keyword named "performance sentiment" in the C4 community. Precision, recall, accuracy and F1-score are the most commonly used evaluation metrics (Dangi et al. 2022 ; Jain et al. 2022 ; JayaLakshmi and Kishore 2022 ; Li et al. 2017 ; Wang et al. 2021 ; Yi and Niblack 2005 ). Some researchers have also used runtimes to calculate the model efficiency (Abo et al. 2021 ; Ferilli et al. 2015 ), p-value to statistically evaluate the relationship or difference between two samples of classification results (JayaLakshmi and Kishore 2022 ; Salur and Aydin 2020 ), paired sample t-tests to verify that the results are not obtained by chance (Nhlabano and Lutu 2018 ), and standard deviation to measure the stability of the model (Chang et al. 2020 ). There have also been researchers who have used G-mean (Wang et al. 2021 ), Pearson Correlation Coefficient (Corr) (Yang et al. 2022 ), Mean Absolute Error (MAE) (Yang et al. 2022 ), Normalized Information Transfer (NIT) and Entropy-Modified Accuracy (EMA) (Valverde-Albacete et al. 2013 ), Mean Squared Error (MSE) (Mao et al. 2022 ), Hamming loss (Liu and Chen 2015 ), Area Under the Curve (AUC) (Abo et al. 2021 ), sensitivity and specificity (Thakur and Deshpande 2019 ), etc.

4.1.2.5 Analysis on research methods and topics of the C5 & C6 communities

Both sub-communities C5 (Fig.  8 ) and C6 (Fig.  9 ) are small in size. The C5 community has 25 nodes and the C6 community has 41 nodes. The core content of the C5 community is "Arabic sentiment analysis." Before 2011, most resources and systems built in the field of sentiment analysis were tailored to English and other Indo-European languages. It is increasingly necessary to design sentiment analysis systems for other languages (Korayem et al. 2012 ), and researchers are increasingly interested in the study of tweets and texts in the Arabic language (Heikal et al. 2018 ; Khasawneh et al. 2013 ; Oueslati et al. 2020 ). They use technologies such as named entity recognition (Al-Laith and Shahbaz 2021 ), deep learning (Al-Ayyoub et al. 2018 ; Heikal et al. 2018 ), and corpus construction (Alayba et al. 2018 ) to enhance the accuracy of sentiment analysis.

figure 8

The keyword co-occurrence network for the C5 community

figure 9

The keyword co-occurrence network for the C6 community

The contents of the C6 community are not very concentrated. From the size of the circle, we can see that the keywords "domain adaptation"(Blitzer et al. 2007 ; Glorot et al. 2011 ), "domain sentiment," and "cross-domain" appear more frequently. Cross-domain sentiment classification is intended to address the lack of mass labeling data (Du et al. 2020a ). It has attracted much attention (Du et al. 2020b ; Hao et al. 2019 ; Yang et al. 2020b ). Advances in communication technology have provided valuable interactive resources for people in different regions, and the processing of multilingual user comments has gradually become a key challenge in natural language processing (Martinez-Garcia et al. 2021 ). Therefore, some keywords related to "lingual" have appeared. Other keywords, such as "transfer learning," "active learning," and "semi-supervised learning," are mainly related to sentiment analysis technologies.

4.2 Evolution of research methods and topics of sentiment analysis

4.2.1 overall evolution analysis.

Annual changes in keyword frequency in sentiment analysis research can reflect the evolution of research methods and topics in this field. Based on the keyword community network (Fig.  3 ), we counted the frequency of keywords in each sub-community for each year. The keyword community evolution diagram is shown in Fig.  10 . Since there were fewer papers published before 2006, we combined the occurrences of keywords from 2002 to 2006. We can see that the C1 community and the C3 community have shown a significant growth trend. The C2 community was in a state of growth until 2019, and the frequency of keywords decreased year by year after 2019. The frequency of C4 community keywords continued to increase until 2018 and declined after 2018. The number of keywords in the C5 community and in the C6 community both had a slow growth trend, but the trend was not obvious.

figure 10

Keyword community evolution diagram

4.2.2 Evolution analysis of sub-communities

We selected the high-frequency keywords under each category and plotted the change of word frequency in each year, as shown in Figs.  11 and 12 . In the C1 community, "social medium," "Twitter," "social network," "covid-19," "Latent Dirichlet Allocation," "topic model," and "text analysis" all had significant increases in word frequency, and the growth trend in 2021 was obvious. "Covid-19" appears in 2020, and the word frequency increased rapidly in 2021. Social media platforms have always been the focus of researchers’ attention. Under the influence of COVID-19, more people express their emotions, stress, and thoughts through social media platforms. Sentiment analysis on data from social media platforms related to COVID-19 has become a hot topic (Boon-Itt and Skunkan 2020 ). We believe that due to the impact of COVID-19, the widespread use of social platforms in 2020–2021 has led to a surge in the number of C1-related keywords.

figure 11

C1, C2, C5, C6 communities: High-frequency keyword evolution diagram

figure 12

C3, C4 communities: High-frequency keyword evolution diagram

The C2 community focuses on the method of "machine learning," and the C3 community focuses on the methods of "deep learning" and "natural language processing." The keywords in the two communities are mainly related to the techniques and methods of sentiment analysis. We have found that before 2016 (Fig.  10 ), the frequency of keywords in the C2 community was higher than that in the C3 community, and in 2016 and later, the frequency of keywords in the C3 community gradually accounted for a larger proportion of the total. This reflects the fact that deep learning-related technologies and methods have become a research hotspot, and the attention given to SVM, Naive Bayes, supervised learning, and other technologies in machine learning has declined. In addition to deep learning models such as Bi-LSTM, Long Short-term Memory, and recurrent neural network in the C3 community, the number of "aspect based" and "feature extraction" keywords have also been growing, which shows that researchers now pay more attention to the aspect level of text granularity in the field of sentiment analysis.

Among the keywords found in the C4 community, the word frequency of the "opinion mining" keyword has decreased since 2018. This shows that in the field of sentiment analysis, researchers have begun to reduce the attention they give to sentiment analysis of opinions on product or service quality, while still maintaining a certain degree of attention to "user review" and "online review." In addition, the number of keywords for "sentiment lexicon" and "lexicon-based" has declined. It may be because, in the context of the widespread application of deep learning technology in recent years, the lexicon-based method requires more time and higher labor costs (Kaity and Balakrishnan 2020 ). However, its accuracy still attracts attention due to the high involvement of experts, especially in non-English languages (Bakar et al. 2019 ; Kydros et al. 2021 ; Piryani et al. 2020 ; Tammina 2020 ; Xing et al. 2019 ; Yurtalan et al. 2019 ).

The high-frequency keywords in the C5 and C6 communities are "Arabic language," "Arabic sentiment analysis," and "transfer learning." Arabic has 30 variants, including the official Modern Standard Arabic (MSA) (ISO 639–3 2017). Arabic dialects are becoming increasingly popular as the language of informal communication on blogs, forums, and social media networks (Lulu and Elnagar 2018 ). This makes them challenging languages for natural language processing and sentiment analysis (Alali et al. 2019 ; Elshakankery and Ahmed 2019 ; Sayed et al. 2020 ). Transfer learning can solve the problem by leveraging knowledge obtained from a large-scale source domain to enhance the classification performance of target domains (Heaton 2018 ). In recent years, based on the success of deep learning technology, this method has gradually attracted attention.

5 Research hotspots and trends

Through the analysis in Sects.  4.1 and 4.2 , we found that the research methods and topics of sentiment analysis are constantly changing. The keyword topic heat map is shown in Fig.  13 . From this map, we can see that in the past two decades, research hotspots have included social media platforms (such as "social medium," "social network," and "Twitter"); sentiment analysis techniques and methods (such as "machine learning," "svm," "natural language processing," "deep learning," "aspect-based," "text mining," and "sentiment lexicon"), mining of user comments or opinions (e.g., "opinion mining," "user review," and "online review"), and sentiment analysis for non-English languages (e.g., "Arabic sentiment analysis" and "Arabic language").

figure 13

Keyword topic heat map

With the popularity of digitization, a large amount of user-generated content has appeared on the Internet, where users express their opinions and comments on different topics such as the news, events, activities, products, services, etc. through social media. This is especially so in the case of the Twitter mobile platform, launched in 2006, which has become the most popular social channel (Kumar and Jaiswal 2020 ). However, online text data is mostly unstructured. In order to accurately analyze users’ sentiments, the research methods for sentiment analysis, such as natural language processing technology, and automatic sentiment analysis models have become the focus of researchers’ works. From Fig.  11 , we can see that early technologies and methods are dominated by machine learning and that SVM and Naive Bayes have always been favored by researchers. This has also been confirmed in studies by Neha Raghuvanshi (Raghuvanshi and Patil 2016 ), Harpreet Kaur (Kaur et al. 2017 ), and Marouane Birjali (Birjali et al. 2021 ). With the improvement of neural network and artificial intelligence technology, deep learning technology has been widely used in sentiment analysis, and has resulted in good outcomes (Basiri et al. 2021 ; Ma et al. 2018 ; Prabha and Srikanth 2019 ; Yuan et al. 2020 ). However, deep learning technology still has room for improvement, and the hybrid methods combining sentiment dictionary and semantic analysis are gradually becoming a trend (Prabha and Srikanth 2019 ; Yang et al. 2020a ).

The granularity of sentiment analysis ranges from the early text level to the sentence level and finally to the aspect level, which is currently gaining strong attention. The granularity of sentiment analysis is gradually being refined, but the method is immature at present, and further research work in the future is needed (Agüero-Torales et al. 2021 ; Li et al. 2020 ; Trisna and Jie 2022 ).

Early sentiment analysis was mainly in the English language. In recent years, non-English languages such as Chinese (Lai et al. 2020 ; Peng et al. 2018 ), French (Apidianaki et al. 2016 ; Pecore and Villaneau 2019 ), Spanish (Chaturvedi et al. 2016 ; Plaza-del-Arco et al. 2020 ), Russian (Smetanin 2020 ), and Arabic (Alhumoud and Al Wazrah 2022 ; Ombabi et al. 2020 ) have attracted more and more attention. Furthermore, cross-domain sentiment analysis technology is in urgent need of research and discussion by researchers (Liu et al. 2019 ; Singh et al. 2021 ).

6 Conclusion and future work

6.1 conclusion.

Judging from the increasing number of papers related to sentiment analysis research every year, sentiment analysis has been on the rise. Although there are many surveys on sentiment analysis research, there has not been a survey dedicated to the evolution of research methods and topics of sentiment analysis. This paper has used keyword co-occurrence analysis and the informetric tools to enrich the perspectives and methods of previous studies. Its aims have been to outline the evolution of the research methods and tools, research hotspots and trends and to provide research guidance for researchers.

By adopting keyword co-occurrence analysis and community detection methods, we analyzed the research methods and topics of sentiment analysis, as well as their connections and evolution trends, and summarized the research hotspots and trends in sentiment analysis. We found that research hotspots include social media platforms, sentiment analysis techniques and methods, mining of user comments or opinions, and sentiment analysis for non-English languages. Moreover, deep learning technology, with its hybrid methods combining sentiment dictionary and semantic analysis, fine-grained sentiment analysis methods, and non-English language analysis methods, and cross-domain sentiment analysis techniques have gradually become the research trends.

6.2 Practical implications and technical directions of sentiment analysis

Sentiment analysis has a wide range of application targets, such as e-commerce platforms, social platforms, public opinion platforms, and customer service platforms. Years of development have led to many related tasks in sentiment analysis, such as sentiment analysis of different text granularity, sentiment recognition, opinion mining, dialogue sentiment analysis, irony recognition, false information detection, etc. Such analysis can help structure user reviews, support product improvement decisions, discover public opinion hotspots, identify public positions, investigate user satisfaction with products, and so on. As long as user-generated content is involved, sentiment analysis technology can be used to mine the emotions of human actors associated with the content. The improvement of sentiment analysis technology can help machines better understand the thoughts and opinions of users, make machines more intelligent, and make better decisions for policy leaders, businessmen, and service people. However, most of the current sentiment analysis methods are based on sentiment dictionaries, sentiment rules, statistics-based machine learning models, neural network-based deep learning models, and pre-training models, and have yet to achieve true language understanding in the sense of comprehension at the deep semantic level, though this does not prevent them from being useful in certain practical applications.

As an important task in natural language understanding, sentiment analysis has received extensive attention from academia and industry. Coarse-grained sentiment analysis is increasingly unable to meet people's decision-making needs, and for aspect-level sentiment analysis and complex tasks, pure machine learning is still unable to flexibly achieve true language understanding. Once the scene or domain changes, problems such as the domain incompatibility of the sentiment dictionary and the low transfer effect of the model involved keep appearing. At present, the accuracy of sentiment analysis provided by machines is far less than that of humans. To achieve human-like performance for machines, we believe that it is necessary to incorporate human commonsense knowledge and domain knowledge, as well as grounded definitions of concepts, in order for machines to understand natural language at a deeper level. These, combined with rules for affective reasoning to supplement interpretable information, will be effective in improving the performance of sentiment analysis. Future research in this direction can be strengthened to achieve true language understanding in machines.

6.3 Limitations and future work

There are some research limitations in this paper. First, we only studied papers written in English and searched from the Web of Science platform. We believe there are papers in other languages or other databases (e.g., Scopus, PubMed, Sci-hub, etc.) that also involve sentiment analysis but that were not included in our study. In addition, the keywords we chose to search in the Web of Science were mainly "sentiment analysis," "sentiment mining," and "sentiment classification." There may be papers related to our research topic that do not have these keywords. To track developments in sentiment analysis research, future studies could replicate this work by employing more precise keywords and using different literature databases.

Second, we selected the main high-frequency keywords for analysis, and some important low-frequency keywords may have been ignored. In future work, we can analyze the changes in each keyword in detail from the perspective of time and obtain more comprehensive analysis results.

Third, the results show that the themes of sentiment analysis cover many fields, such as computer science, linguistics, and electrical engineering, which indicates the trend of interdisciplinary research. Therefore, future work should apply co-citation and diversity measures to explore the interdisciplinary nature of sentiment analysis research.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

https://github.com/MaartenGr/KeyBERT .

https://homepage.univie.ac.at/juan.gorraiz/bibexcel/ .

http://mrvar.fdv.uni-lj.si/pajek/ .

https://www.vosviewer.com/ .

Abbasi A, France S, Zhang Z, Chen H (2011) Selecting attributes for sentiment classification using feature relation networks. IEEE Trans Knowl Data Eng 23(3):447–462. https://doi.org/10.1109/TKDE.2010.110

Article   Google Scholar  

Abdullah NSD, Zolkepli IA (2017) Sentiment analysis of online crowd input towards Brand Provocation in Facebook, Twitter, and Instagram. In: Proceedings of the international conference on big data and internet of thing, association for computing machinery, pp 67–74. https://doi.org/10.1145/3175684.3175689

Abo MEM, Idris N, Mahmud R, Qazi A, Hashem IAT, Maitama JZ et al (2021) A multi-criteria approach for Arabic dialect sentiment analysis for online reviews: exploiting optimal machine learning algorithm selection. Sustainability 13(18):10018. https://doi.org/10.3390/su131810018

Abrahams AS, Jiao J, Wang GA, Fan W (2012) Vehicle defect discovery from social media. Decis Support Syst 54(1):87–97. https://doi.org/10.1016/j.dss.2012.04.005

Acheampong FA, Nunoo-Mensah H, Chen W (2021) Transformer models for text-based emotion detection: a review of BERT-based approaches. Artif Intell Rev 54(8):5789–5829. https://doi.org/10.1007/s10462-021-09958-2

Adak A, Pradhan B, Shukla N (2022) Sentiment analysis of customer reviews of food delivery services using deep learning and explainable artificial intelligence: systematic review. Foods 11(10):1500. https://doi.org/10.3390/foods11101500

Agüero-Torales MM, Salas JIA, López-Herrera AG (2021) Deep learning and multilingual sentiment analysis on social media data: an overview. Appl Soft Comput 107:107373. https://doi.org/10.1016/j.asoc.2021.107373

Ahuja R, Rastogi H, Choudhuri A, Garg B (2015) Stock market forecast using sentiment analysis. In: 2015 2nd International conference on computing for sustainable global development, INDIACom 2015, Bharati Vidyapeeth, New Delhi, pp 1008–1010. https://doi.org/10.48550/arXiv.2204.05783

Ain QT, Ali M, Riaz A, Noureen A, Kamranz M, Hayat B et al (2017) Sentiment analysis using deep learning techniques: a review. Int J Adv Comput Sci Appl 8(6):424–433. https://doi.org/10.14569/ijacsa.2017.080657

Al-Ayyoub M, Nuseir A, Alsmearat K, Jararweh Y, Gupta B (2018) Deep learning for Arabic NLP: a survey. J Comput Sci 26:522–531. https://doi.org/10.1016/j.jocs.2017.11.011

Al-Ayyoub M, Khamaiseh AA, Jararweh Y, Al-Kabi MN (2019) A comprehensive survey of Arabic sentiment analysis. Inf Process Manag 56(2):320–342. https://doi.org/10.1016/j.ipm.2018.07.006

Al-Dabet S, Tedmori S, AL-Smadi M (2021) Enhancing Arabic aspect-based sentiment analysis using deep learning models. Comput Speech Lang 69:1224. https://doi.org/10.1016/j.csl.2021.101224

Al-Laith A, Shahbaz M (2021) Tracking sentiment towards news entities from Arabic news on social media. Futur Gener Comput Syst 118:467–484. https://doi.org/10.1016/j.future.2021.01.015

Al-Smadi M, Talafha B, Al-Ayyoub M, Jararweh Y (2019) Using long short-term memory deep neural networks for aspect-based sentiment analysis of Arabic reviews. Int J Mach Learn Cybern 10(8):2163–2175. https://doi.org/10.1007/s13042-018-0799-4

Alali M, Sharef NM, Murad MAA, Hamdan H, Husin NA (2019) Narrow convolutional neural network for Arabic dialects polarity classification. IEEE Access 7:96272–96283. https://doi.org/10.1109/ACCESS.2019.2929208

Alamoodi AH, Zaidan BB, Al-Masawa M, Taresh SM, Noman S, Ahmaro IYY et al (2021a) Multi-perspectives systematic review on the applications of sentiment analysis for vaccine hesitancy. Comput Biol Med 139:104957. https://doi.org/10.1016/j.compbiomed.2021.104957

Alamoodi AH, Zaidan BB, Zaidan AA, Albahri OS, Mohammed KI, Malik RQ et al (2021b) Sentiment analysis and its applications in fighting COVID-19 and infectious diseases: a systematic review. Expert Syst Appl 167:114155. https://doi.org/10.1016/j.eswa.2020.114155

Alayba AM, Palade V, England M, Iqbal R (2018) Improving sentiment analysis in arabic using word representation. In: 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR), IEEE, pp 13–18. https://doi.org/10.1109/ASAR.2018.8480191

Alhumoud SO, Al Wazrah AA (2022) Arabic sentiment analysis using recurrent neural networks: a review. Artif Intell Rev 55(1):707–748. https://doi.org/10.1007/s10462-021-09989-9

Alonso MA, Vilares D, Gómez-Rodríguez C, Vilares J (2021) Sentiment analysis for fake news detection. Electronics 10(11):1348. https://doi.org/10.3390/electronics10111348

Altrabsheh N, Gaber MM, Cocea M (2013) SA-E: sentiment analysis for education. In: The 5th KES International Conference on Intelligent Decision Technologies (KES-IDT), Sesimbra, Portugal, pp 353–362. https://doi.org/10.3233/978-1-61499-264-6-353

Al Amrani Y, Lazaar M, El Kadirp KE (2018) Random forest and support vector machine based hybrid approach to sentiment analysis. Procedia Comput Sci 127:511–520. https://doi.org/10.1016/j.procs.2018.01.150

An H, Moon N (2022) Design of recommendation system for tourist spot using sentiment analysis based on CNN-LSTM. J Ambient Intell Hum Comput 13:1653–1663. https://doi.org/10.1007/s12652-019-01521-w

Angel SO, Negron APP, Espinoza-Valdez A (2021) Systematic literature review of sentiment analysis in the spanish language. Data Technol Appl 55(4):461–479. https://doi.org/10.1108/DTA-09-2020-0200

Apidianaki M, Tannier X, Richart C (2016) Datasets for aspect-based sentiment analysis in French. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16), Portorož, Slovenia: European Language Resources Association (ELRA), pp 1122–1126. https://aclanthology.org/L16-1179

Arafin Mahtab S, Islam N, Mahfuzur Rahaman M (2018) Sentiment analysis on Bangladesh cricket with support vector machine. In: 2018 International conference on Bangla Speech and language processing (ICBSLP), IEEE, pp 1–4. https://doi.org/10.1109/ICBSLP.2018.8554585

Artemenko O, Pasichnyk V, Kunanets N, Shunevych K (2020) Using sentiment text analysis of user reviews in social media for E-Tourism mobile recommender systems. In: COLINS, CEUR-WS, Aachen, pp 259–271. http://ceur-ws.org/Vol-2604/paper20.pdf

Arulmurugan R, Sabarmathi KR, Anandakumar H (2019) Classification of sentence level sentiment analysis using cloud machine learning techniques. Clust Comput 22(1):1199–1209. https://doi.org/10.1007/s10586-017-1200-1

Asghar MZ, Khan A, Ahmad S, Kundi FM (2014) A review of feature selection techniques in sentiment analysis. J Basic Appl Sci Res 4(3):181–186. https://doi.org/10.3233/IDA-173763

Awan MJ, Yasin A, Nobanee H, Ali AA, Shahzad Z, Nabeel M et al (2021) Fake news data exploration and analytics. Electronics 10(19):2326. https://doi.org/10.3390/electronics10192326

Bai H, Yu G (2016) A Weibo-based approach to disaster informatics: incidents monitor in post-disaster situation via weibo text negative sentiment analysis. Nat Hazards 83(2):1177–1196. https://doi.org/10.1007/s11069-016-2370-5

Bakar MFRA, Idris N, Shuib L (2019) An enhancement of Malay social media text normalization for Lexicon-based sentiment analysis. In: 2019 International conference on Asian language processing (IALP), IEEE, pp 211–215. https://doi.org/10.1109/IALP48816.2019.9037700

Bar-Ilan J (2008) Informetrics at the beginning of the 21st century—a review. J Informet 2(1):1–52. https://doi.org/10.1016/j.joi.2007.11.001

Basiri ME, Nemati S, Abdar M, Cambria E, Acharya UR (2021) ABCDM: an attention-based bidirectional CNN-RNN deep model for sentiment analysis. Futur Gener Comput Syst 115:279–294. https://doi.org/10.1016/j.future.2020.08.005

Batagelj V, Andrej M (2022) Pajek [Software]. http://mrvar.fdv.uni-lj.si/pajek/

Batagelj V, Mrvar A (1998) Pajek-program for large network analysis eds. M. Jünger and P Mutzel. Connections 21(2): 47–57. http://vlado.fmf.uni-lj.si/pub/networks/doc/pajek.pdf

Bengtsson M (2016) How to plan and perform a qualitative study using content analysis. NursingPlus Open 2:8–14. https://doi.org/10.1016/j.npls.2016.01.001

Berkovic D, Ackerman IN, Briggs AM, Ayton D (2020) Tweets by people with arthritis during the COVID-19 pandemic: content and sentiment analysis. J Med Internet Res 22(12):e24550. https://doi.org/10.2196/24550

Binkheder S, Aldekhyyel RN, Almogbel A, Al-Twairesh N, Alhumaid N, Aldekhyyel SN et al (2021) Public perceptions around Mhealth applications during Covid-19 pandemic: a network and sentiment analysis of tweets in Saudi Arabia. Int J Environ Res Public Health 18(24):1–22. https://doi.org/10.3390/ijerph182413388

Birjali M, Kasri M, Beni-Hssane A (2021) A comprehensive survey on sentiment analysis: approaches, challenges and trends. Knowl Based Syst 226:107134. https://doi.org/10.1016/j.knosys.2021.107134

Blitzer J, Dredze M, Pereira F (2007) Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: 45th Annual Meeting of the association of computational linguistics, association for computational linguistics, pp 440–447. https://doi.org/10.1287/ijoc.2013.0585

Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008(10):P10008. https://doi.org/10.1088/1742-5468/2008/10/P10008

Article   MATH   Google Scholar  

Boon-Itt S, Skunkan Y (2020) Public perception of the COVID-19 pandemic on Twitter: sentiment analysis and topic modeling study. JMIR Public Health Surv 6(4):1978. https://doi.org/10.2196/21978

Boudad N, Faizi R, Thami ROH, Chiheb R (2018) Sentiment analysis in Arabic: a review of the literature. Ain Shams Eng J 9(4):2479–2490. https://doi.org/10.1016/j.asej.2017.04.007

Bouktif S, Fiaz A, Awad M (2020) Augmented textual features-based stock market prediction. IEEE Access 8:40269–40282. https://doi.org/10.1109/ACCESS.2020.2976725

Brito KDS, Filho RLCS, Adeodato PJL (2021) A systematic review of predicting elections based on social media data: research challenges and future directions. IEEE Trans Comput Soc Syst 8(4):819–843. https://doi.org/10.1109/TCSS.2021.3063660

Cai G, Xia B (2015) Convolutional neural networks for multimedia sentiment analysis. In: Natural Language Processing and Chinese Computing, Springer, Cham, p 159–167. https://doi.org/10.1007/978-3-319-25207-0_14

Callon M, Courtial J-P, Turner WA, Bauin S (1983) From translations to problematic networks: an introduction to co-word analysis. Soc Sci Inf 22(2):191–235. https://doi.org/10.1177/053901883022002003

Cambria E, Liu Q, Decherchi S, Xing F, Kwok K (2022a) SenticNet 7: a commonsense-based neurosymbolic AI Framework for Explainable Sentiment Analysis. In: LREC, Marseille: European Language Resources Association (ELRA), pp 3829–3839. https://sentic.net/senticnet-7.pdf

Cambria E, Dragoni M, Kessler B, Donadello I (2022b) Ontosenticnet 2: enhancing reasoning within sentiment analysis. IEEE Intell Syst 37(2):103–110. https://doi.org/10.1109/MIS.2021.3093659

Cambria E, Kumar A, Al-Ayyoub M, Howard N (2022c) Guest editorial: explainable artificial intelligence for sentiment analysis. Knowl Based Syst 238(3):107920. https://doi.org/10.1016/j.knosys.2021.107920

Cambria E, Xing F, Thelwall M, Welsch R (2022d) Sentiment analysis as a multidisciplinary research area. IEEE Trans Artif Intell 3(2):1–4

Google Scholar  

Chan JY-L, Bea KT, Leow SMH, Phoong SW, Cheng WK (2022) State of the art: a review of sentiment analysis based on sequential transfer learning. Artif Intell Rev. https://doi.org/10.1007/s10462-022-10183-8

Chang J-R, Liang H-Y, Chen L-S, Chang C-W (2020) Novel feature selection approaches for improving the performance of sentiment classification. J Ambient Intell Hum Comput. https://doi.org/10.1007/s12652-020-02468-z

Chaturvedi I, Cambria E, Vilares D (2016) Lyapunov filtering of objectivity for Spanish sentiment model. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN), IEEE, pp 4474–4481. https://doi.org/10.1109/IJCNN.2016.7727785

Chen Z, Teng S, Zhang W, Tang H, Zhang Z, He J, et al (2019) LSTM sentiment polarity analysis based on LDA clustering. In: Communications in Computer and Information Science, Springer, Singapore, pp 342–355. https://doi.org/10.1007/978-981-13-3044-5_25

Cheng WK, Bea KT, Leow SMH, Chan JY-L, Hong Z-W, Chen Y-L (2022) A review of sentiment, semantic and event-extraction-based approaches in stock forecasting. Mathematics 10(14):2437. https://doi.org/10.3390/math10142437

Da’u A, Salim N, Rabiu I, Osman A (2020) Recommendation System Exploiting Aspect-Based Opinion Mining with Deep Learning Method. Inf Sci 512:1279–1292. https://doi.org/10.1016/j.ins.2019.10.038

Dangi D, Bhagat A, Dixit DK (2022) Sentiment analysis of social media data based on chaotic coyote optimization algorithm based time weight-adaboost support vector machine approach. Concurr Comput 34(3):6581. https://doi.org/10.1002/cpe.6581

Deng S, Xia S, Hu J, Li H, Liu Y (2021) Exploring the topic structure and evolution of associations in information behavior research through co-word analysis. J Librariansh Inf Sci 53(2):280–297. https://doi.org/10.1177/0961000620938120

Dereli T, Eligüzel N, Çetinkaya C (2021) Content analyses of the international federation of Red Cross and Red Crescent Societies (Ifrc) based on machine learning techniques through Twitter. Nat Hazards 106(3):2025–2045. https://doi.org/10.1007/s11069-021-04527-w

Dey A, Jenamani M, Thakkar JJ (2017) Lexical Tf-Idf: An n-Gram Feature Space for Cross-Domain Classification of Sentiment Reviews. In: International Conference on Pattern Recognition and Machine Intelligence, Springer, Cham, pp 380–386. https://doi.org/10.1007/978-3-319-69900-4_48

Ding Y, Chowdhury GG, Foo S (2001) Bibliometric cartography of information retrieval research by using co-word analysis. Inf Process Manag 37(6):817–842. https://doi.org/10.1016/S0306-4573(00)00051-0

Du C, Sun H, Wang J, Qi Q, Liao J (2020a) Adversarial and domain-aware BERT for cross-domain sentiment analysis. In: Proceedings of the 58th Annual meeting of the association for computational linguistics, association for computational linguistics, p 4019–4028. https://doi.org/10.18653/v1/2020a.acl-main.370

Du Y, He M, Wang L, Zhang H (2020b) Wasserstein based transfer network for cross-domain sentiment classification. Knowl Based Syst 204:6162. https://doi.org/10.1016/j.knosys.2020.106162

Elo S, Kyngäs H (2008) The qualitative content analysis process. J Adv Nurs 62(1):107–115. https://doi.org/10.1111/j.1365-2648.2007.04569.x

Elshakankery K, Ahmed MF (2019) HILATSA: a hybrid incremental learning approach for arabic tweets sentiment analysis. Egypt Inform J 20(3):163–171. https://doi.org/10.1016/j.eij.2019.03.002

Feldman R (2013) Techniques and applications for sentiment analysis. Commun ACM 56(4):82–89. https://doi.org/10.1145/2436256.2436274

Ferilli S, De Carolis B, Esposito F, Redavid D (2015) Sentiment analysis as a text categorization task: a study on feature and algorithm selection for Italian language. In: 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA), IEEE, pp 1–10. https://doi.org/10.1109/DSAA.2015.7344882

Fink C, Bos N, Perrone A, Liu E, Kopecky J (2013) Twitter, public opinion, and the 2011 Nigerian Presidential Election. In: 2013 International conference on social computing, IEEE, pp 311–320. https://doi.org/10.1109/SocialCom.2013.50

Fitri VA, Andreswari R, Hasibuan MA (2019) Sentiment analysis of social media Twitter with case of anti-LGBT campaign in Indonesia using Naïve Bayes, Decision Tree, and Random Forest Algorithm. Procedia Comput Sci 161:765–772. https://doi.org/10.1016/j.procs.2019.11.181

Garg S (2021) Drug recommendation system based on sentiment analysis of drug reviews using machine learning. In: 2021 11th International conference on cloud computing, data science & engineering (confluence), IEEE, pp 175–181. https://doi.org/10.1109/Confluence51648.2021.9377188

Glorot X, Bordes A, Bengio Y (2011) Domain adaptation for large-scale sentiment classification: a deep learning approach. In: 28th International Conference on Machine Learning, International Machine Learning Society (IMLS), pp 513–520. https://dl.acm.org/doi/ https://doi.org/10.5555/3104482.3104547

Grootendorst M, Warmerdam VD (2021) MaartenGr/KeyBERT (Version 0.5) [Computer program]. https://doi.org/10.5281/ZENODO.5534341 .

Groshek J, Al-Rawi A (2013) Public sentiment and critical framing in social media content during the 2012 US Presidential Campaign. Soc Sci Comput Rev 31(5):563–576. https://doi.org/10.1177/0894439313490401

Habimana O, Li Y, Li R, Gu X, Yu G (2020) Sentiment analysis using deep learning approaches: an overview. Sci China Inf Sci 63(1):1–36. https://doi.org/10.1007/s11432-018-9941-6

Hao Y, Mu T, Hong R, Wang M, Liu X, Goulermas JY (2019) Cross-domain sentiment encoding through stochastic word embedding. IEEE Trans Knowl Data Eng 32(10):1909–1922. https://doi.org/10.1109/TKDE.2019.2913379

Hassan A, Mahmood A (2017) Efficient deep learning model for text classification based on recurrent and convolutional layers. In: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), IEEE, pp 1108–1113. https://doi.org/10.1109/ICMLA.2017.00009

Heaton J (2018). Ian Goodfellow, Yoshua Bengio, and Aaron Courville: Deep Learning. Genetic Programming and Evolvable Machines 19: 305–307. https://doi.org/10.1007/s10710-017-9314-z

Heikal M, Torki M, El-Makky N (2018) Sentiment analysis of Arabic tweets using deep learning. Procedia Comput Sci 142:114–122. https://doi.org/10.1016/j.procs.2018.10.466

Huang B, Ou Y, Carley KM (2018) Aspect level sentiment classification with attention-over-attention neural networks. In: International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation, Springer, Cham, pp 197–206. https://doi.org/10.1007/978-3-319-93372-6_22

Hussain N, Mirza HT, Rasool G, Hussain I, Kaleem M (2019) Spam review detection techniques: a systematic literature review. Appl Sci 9(5):987. https://doi.org/10.3390/app9050987

Hussein DMEDM (2018) A survey on sentiment analysis challenges. J King Saud Univ 30(4):330–338. https://doi.org/10.1016/j.jksues.2016.04.002

Ikram MT, Afzal MT (2019) Aspect based citation sentiment analysis using linguistic patterns for better comprehension of scientific knowledge. Scientometrics 119(1):73–95. https://doi.org/10.1007/s11192-019-03028-9

Injadat MN, Salo F, Nassif AB (2016) Data mining techniques in social media: a survey. Neurocomputing 214:654–670. https://doi.org/10.1016/j.neucom.2016.06.045

Isah H, Trundle P, Neagu D (2014) Social media analysis for product safety using text mining and sentiment analysis. In: 2014 14th UK Workshop on Computational Intelligence (UKCI), IEEE, pp 1–7. https://doi.org/10.1109/UKCI.2014.6930158

ISO 639-3 (2017) Registration Authority. https://iso639-3.sil.org/

Jain DK, Boyapati P, Venkatesh J, Prakash M (2022) An intelligent cognitive-inspired computing with big data analytics framework for sentiment analysis and classification. Inf Process Manag 59(1):2758. https://doi.org/10.1016/j.ipm.2021.102758

Januário BA, de Carosia AEO, da Silva AEA, Coelho GP (2022) Sentiment analysis applied to news from the Brazilian stock market. IEEE Latin Am Trans 20(3):512–518. https://doi.org/10.1109/TLA.2022.9667151

JayaLakshmi ANM, Kishore KVK (2022) Performance evaluation of DNN with other machine learning techniques in a cluster using apache spark and MLlib. J King Saud Univ 34(1):1311–1319. https://doi.org/10.1016/j.jksuci.2018.09.022

Jia X, Wang L (2022) Attention enhanced capsule network for text classification by encoding syntactic dependency trees with graph convolutional neural network. PeerJ Comput Sci 7:e831. https://doi.org/10.7717/PEERJ-CS.831

Jiang D, Luo X, Xuan J, Xu Z (2017) Sentiment computing for the news event based on the social media big data. IEEE Access 5:2373–2382. https://doi.org/10.1109/ACCESS.2016.2607218

Kaity M, Balakrishnan V (2020) Sentiment Lexicons and non-English languages: a survey. Knowl Inf Syst 62(12):4445–4480. https://doi.org/10.1007/s10115-020-01497-6

Kastrati Z, Dalipi F, Imran AS, Nuci KP, Wani MA (2021) Sentiment analysis of students’ feedback with Nlp and deep learning: a systematic mapping study. Appl Sci 11(9):3986. https://doi.org/10.3390/app11093986

Kaur H, Mangat V, Nidhi (2017) A survey of sentiment analysis techniques. In: 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC), IEEE, pp 921–925. https://doi.org/10.1109/I-SMAC.2017.8058315

Khan K, Baharudin BB, Khan A (2009) Mining opinion from text documents: a survey. In: 2009 3rd IEEE International conference on digital ecosystems and technologies, IEEE, pp 217–222. https://doi.org/10.4304/jetwi.5.4.343-353

Khasawneh RT, Wahsheh HA, Al-Kabi MN, Alsmadi IM (2013) Sentiment analysis of Arabic social media content: a comparative study. In: 8th International Conference for Internet Technology and Secured Transactions (ICITST-2013), IEEE, pp 101–106. https://doi.org/10.1109/ICITST.2013.6750171

Khattak A, Asghar MZ, Saeed A, Hameed IA, Asif Hassan S, Ahmad S (2021) A survey on sentiment analysis in Urdu: a resource-poor language. Egypt Inform J 22(1):53–74. https://doi.org/10.1016/j.eij.2020.04.003

Khatua A, Khatua A, Cambria E (2020) Predicting political sentiments of voters from Twitter in multi-party contexts. Appl Soft Comput J 97:106743. https://doi.org/10.1016/j.asoc.2020.106743

Kitchenham B (2004) Procedures for performing systematic reviews, version 1.0. Empir Softw Eng 33(2004):1–26

Kitchenham B, Charters SM (2007) Guidelines for performing systematic literature reviews in software engineering. Tech Rep 5:1–57

Korayem M, Crandall D, Abdul-Mageed M (2012) Subjectivity and sentiment analysis of Arabic: a survey. In: International conference on advanced machine learning technologies and applications, Springer, Berlin, Heidelberg, p 128–139. https://doi.org/10.1007/978-3-642-35326-0_14

Koto F, Adriani M (2015) A comparative study on Twitter sentiment analysis: Which Features Are Good? In: International conference on applications of natural language to information systems, Springer, Cham, p 453–457. https://doi.org/10.1007/978-3-319-19581-0_46

Krippendorff K (2018) Content analysis: an introduction to its methodology. Sage publications.

Kumar A, Garg G (2020) Systematic literature review on context-based sentiment analysis in social multimedia. Multimed Tools Appl 79(21):15349–15380. https://doi.org/10.1007/s11042-019-7346-5

Kumar A, Jaiswal A (2020) Systematic literature review of sentiment analysis on twitter using soft computing techniques. Concurr Comput 32(1):e5107. https://doi.org/10.1002/cpe.5107

Article   MathSciNet   Google Scholar  

Kumar A, Sebastian TM (2012) Sentiment analysis: a perspective on its past, present and future. Int J Intell Syst Appl 4(10):1–14. https://doi.org/10.5815/ijisa.2012.10.01

Kumar A, Narapareddy VT, Gupta P, Srikanth VA, Neti LB, Malapati A (2021) Adversarial and auxiliary features-aware BERT for sarcasm detection. In: 8th ACM IKDD CODS and 26th COMAD, association for computing machinery, p 163–170. https://doi.org/10.1145/3430984.3431024

Kydros D, Argyropoulou M, Vrana V (2021) A content and sentiment analysis of Greek tweets during the pandemic. Sustainability (switzerland) 13(11):6150. https://doi.org/10.3390/su13116150

Lai Y, Zhang L, Han D, Zhou R, Wang G (2020) Fine-grained emotion classification of chinese microblogs based on graph convolution networks. World Wide Web 23(5):2771–2787. https://doi.org/10.1007/s11280-020-00803-0

Leiden University's Centre for Science and Technology Studies (CWTS) (2021) VOSviewer (Version 1.6.17)[Software]. https://www.vosviewer.com/

Leydesdorff L, Park HW, Wagner C (2014) International co-authorship relations in the social science citation index: is internationalization leading the network? J Assoc Inf Sci Technol 65(10):2111–2126. https://doi.org/10.48550/arXiv.1305.4242

Li D, Qian J (2016) Text sentiment analysis based on long short-term memory. In: 2016 First IEEE International Conference on Computer Communication and the Internet (ICCCI), IEEE, pp 471–475. https://doi.org/10.1109/CCI.2016.7778967

Li F, Huang M, Zhu X (2010) Sentiment analysis with global topics and local dependency. In: Proceedings of the AAAI Conference on Artificial Intelligence, Atlanta, Georgia, USA: AAAI Press, Palo Alto, California USA, pp 1371–1376. https://doi.org/10.1609/aaai.v24i1.7523

Li J, Sun M (2007) Experimental study on sentiment classification of chinese review using machine learning techniques. In: 2007 International Conference on Natural Language Processing and Knowledge Engineering, IEEE, pp 393–400. https://doi.org/10.1109/NLPKE.2007.4368061

Li N, Liang X, Li X, Wang C, Wu DD (2009) Network environment and financial risk using machine learning and sentiment analysis. Hum Ecol Risk Assess 15(2):227–252. https://doi.org/10.1080/10807030902761056

Li W, Zhu L, Shi Y, Guo K, Cambria E (2020) User reviews: sentiment analysis using Lexicon integrated two-channel CNN–LSTM family models. Appl Soft Comput J 94:6435. https://doi.org/10.1016/j.asoc.2020.106435

Li W, Shao W, Ji S, Cambria E (2022) BiERU: bidirectional emotional recurrent unit for conversational sentiment analysis. Neurocomputing 467:73–82. https://doi.org/10.1016/j.neucom.2021.09.057

Li Y, Pan Q, Yang T, Wang S, Tang J, Cambria E (2017) Learning word representations for sentiment analysis. Cogn Comput 9(6):843–851. https://doi.org/10.1007/s12559-017-9492-2

Liang B, Su H, Gui L, Cambria E, Xu R (2022) Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks. Knowl Based Syst 235:107643. https://doi.org/10.1016/j.knosys.2021.107643

Ligthart A, Catal C, Tekinerdogan B (2021) Systematic reviews in sentiment analysis: a tertiary study. Artif Intell Rev 54(7):4997–5053. https://doi.org/10.1007/s10462-021-09973-3

Lin B, Cassee N, Serebrenik A, Bavota G, Novielli N, Lanza M (2022) Opinion mining for software development: a systematic literature review. ACM Trans Softw Eng Methodol 31(3):1–41. https://doi.org/10.1145/3490388

Lin Y, Li J, Yang L, Xu K, Lin H (2020) Sentiment analysis with comparison enhanced deep neural network. IEEE Access 8:78378–78384. https://doi.org/10.1109/ACCESS.2020.2989424

Liu F, Zheng J, Zheng L, Chen C (2020a) Combining attention-based bidirectional gated recurrent neural network and two-dimensional convolutional neural network for document-level sentiment classification. Neurocomputing 371:39–50. https://doi.org/10.1016/j.neucom.2019.09.012

Liu L, Nie X, Wang H (2012) Toward a fuzzy domain sentiment ontology tree for sentiment analysis. In: 2012 5th International congress on image and signal processing, IEEE, pp 1620–1624. https://doi.org/10.1109/CISP.2012.6469930

Liu R, Shi Y, Ji C, Jia M (2019) A survey of sentiment analysis based on transfer learning. IEEE Access 7:85401–85412. https://doi.org/10.1109/ACCESS.2019.2925059

Liu S, Lee K, Lee I (2020b) Document-level multi-topic sentiment classification of email data with BiLSTM and data augmentation. Knowl Based Syst 197:105918. https://doi.org/10.1016/j.knosys.2020.105918

Liu SM, Chen JH (2015) A multi-label classification based approach for sentiment classification. Expert Syst Appl 42(3):1083–1093. https://doi.org/10.1016/j.eswa.2014.08.036

Liu X, Zeng D, Li J, Wang F-Y, Zuo W (2009) Sentiment analysis of Chinese documents: from sentence to document level. J Am Soc Inform Sci Technol 60(12):2474–2487. https://doi.org/10.1002/asi.21206

Lo YW, Potdar V (2009) A review of opinion mining and sentiment classification framework in social networks. In: 2009 3rd IEEE International conference on digital ecosystems and technologies, IEEE, pp 396–401. https://doi.org/10.1109/DEST.2009.5276705

Lulu L, Elnagar A (2018) Automatic arabic dialect classification using deep learning models. Procedia Comput Sci 142:262–269. https://doi.org/10.1016/j.procs.2018.10.489

Ma Y, Peng H, Cambria E (2018) Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive LSTM. In: 32nd AAAI conference on artificial intelligence, New Orleans, Louisiana, USA: AAAI Press, Palo Alto, California USA, pp 5876–5883. https://doi.org/10.1609/aaai.v32i1.12048

Malandri L, Porcel C, Xing F, Serrano-Guerrero J, Cambria E (2022) Soft computing for recommender systems and sentiment analysis. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2021.108246

Mäntylä MV, Graziotin D, Kuutila M (2018) The evolution of sentiment analysis-a review of research topics, venues and top cited papers. Comput Sci Rev 27:16–32. https://doi.org/10.1016/j.cosrev.2017.10.002

Mao Y, Zhang Y, Jiao L, Zhang H (2022) Document-level sentiment analysis using attention-based bi-directional long short-term memory network and two-dimensional convolutional neural network. Electronics 11(12):1906. https://doi.org/10.3390/electronics11121906

Maqsood H, Mehmood I, Maqsood M, Yasir M, Afzal S, Aadil F et al (2020) A local and global event sentiment based efficient stock exchange forecasting using deep learning. Int J Inf Manag 50:432–451. https://doi.org/10.1016/j.ijinfomgt.2019.07.011

Martinez-Camara E, Martin-Valdivia MT, Urena-Lopez LA (2011) Opinion classification techniques applied to a Spanish Corpus. In: International conference on application of natural language to information systems, Springer, Berlin, Heidelberg, pp 169–176. https://doi.org/10.1007/978-3-642-22327-3_17

Martinez-Garcia A, Badia T, Barnes J (2021) Evaluating morphological typology in zero-shot cross-lingual transfer. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, association for computational linguistics, pp 3136–3153. https://doi.org/10.18653/v1/2021.acl-long.244

Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J 5(4):1093–1113. https://doi.org/10.1016/j.asej.2014.04.011

Momtazi S (2012) Fine-grained German sentiment analysis on social media. In: Proceedings of the 8th International conference on language resources and evaluation (LREC’12), European Language Resources Association (ELRA), pp 1215–1220. http://www.lrec-conf.org/proceedings/lrec2012/pdf/999_Paper.pdf

Myslin M, Zhu SH, Chapman W, Conway M (2013) Using Twitter to examine smoking behavior and perceptions of emerging tobacco products. J Med Int Res 15(8):174. https://doi.org/10.2196/jmir.2534

Nair RR, Mathew J, Muraleedharan V, Deepa Kanmani S (2019) Study of machine learning techniques for sentiment analysis. In: 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), IEEE, pp 978–984. https://doi.org/10.1109/ICCMC.2019.8819763

Nassif AB, Elnagar A, Shahin I, Henno S (2021) Deep learning for Arabic subjective sentiment analysis: challenges and research opportunities. Appl Soft Comput 98:6836. https://doi.org/10.1016/j.asoc.2020.106836

Nassirtoussi AK, Aghabozorgi S, Wah TY, Ngo DCL (2014) Text mining for market prediction: a systematic review. Expert Syst Appl 41(16):7653–7670. https://doi.org/10.1016/j.eswa.2014.06.009

Nejat B, Carenini G, Ng R (2017) Exploring joint neural model for sentence level discourse parsing and sentiment analysis. In: Proceedings of the 18th annual sigdial meeting on discourse and dialogue, association for computational linguistics, pp 289–298. https://doi.org/10.18653/v1/w17-5535

Nhlabano VV, Lutu PEN (2018). Impact of text pre-processing on the performance of sentiment analysis models for social media data. In: 2018 International Conference on Advances in Big Data, Computing and Data Communication Systems (IcABCD), IEEE, pp 1–6. https://doi.org/10.1109/ICABCD.2018.8465135

Nicholls C, Song F (2010) Comparison of feature selection methods for sentiment analysis. In: Canadian conference on artificial intelligence, Springer, Berlin, Heidelberg, pp 286–289. https://doi.org/10.1007/978-3-319-96292-4_21

Nielsen FA (2011) A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs. In: Proceedings of the ESWC2011 workshop on “Making Sense of Microposts”: big things come in small packages, Heraklion, Crete, Greece: CEUR-WS, Aachen, pp 93–98. https://doi.org/10.48550/arXiv.1103.2903

Obiedat R, Al-Darras D, Alzaghoul E, Harfoushi O (2021) Arabic aspect-based sentiment analysis: a systematic literature review. IEEE Access 9:152628–152645. https://doi.org/10.1109/ACCESS.2021.3127140

Ombabi AH, Ouarda W, Alimi AM (2020) Deep learning CNN–LSTM framework for Arabic sentiment analysis using textual information shared in social networks. Soc Netw Anal Min 10(1):1–13. https://doi.org/10.1007/s13278-020-00668-1

Oueslati O, Cambria E, Ben HM, Ounelli H (2020) A review of sentiment analysis research in Arabic language. Futur Gener Comput Syst 112:408–430. https://doi.org/10.1016/j.future.2020.05.034

Ouyang X, Zhou P, Li CH, Liu L (2015) Sentiment Analysis Using Convolutional Neural Network. In: 2015 IEEE International conference on computer and information technology; ubiquitous computing and communications; dependable, autonomic and secure computing; pervasive intelligence and computing, IEEE, p 2359–2364. https://doi.org/10.1109/CIT/IUCC/DASC/PICOM.2015.349

Pecore S, Villaneau J (2019) Complex and Precise Movie and Book Annotations in French Language for Aspect Based Sentiment Analysis. In: LREC 2018—11th International conference on language resources and evaluation, European Language Resources Association (ELRA), p 2647–2652. https://aclanthology.org/L18-1419

Peng H, Cambria E, Hussain A (2017) A review of sentiment analysis research in Chinese language. Cogn Comput 9(4):423–435. https://doi.org/10.1007/s12559-017-9470-8

Peng H, Ma Y, Li Y, Cambria E (2018) Learning multi-grained aspect target sequence for Chinese sentiment analysis. Knowl Based Syst 148:167–176. https://doi.org/10.1016/j.knosys.2018.02.034

Pereira DA (2021) A survey of sentiment analysis in the Portuguese language. Artif Intell Rev 54(2):1087–1115. https://doi.org/10.1007/s10462-020-09870-1

Perianes-Rodriguez A, Waltman L, van Eck NJ (2016) Constructing bibliometric networks: a comparison between full and fractional counting. J Informetr 10(4):1178–1195. https://doi.org/10.1016/j.joi.2016.10.006

Persson O (2017) BibExcel [Software]. Available from https://homepage.univie.ac.at/juan.gorraiz/bibexcel/

Persson O, Danell R, Schneider JW (2009) How to Use Bibexcel for Various Types of Bibliometric Analysis. In: Celebrating scholarly communication studies: a festschrift for Olle Persson at his 60th birthday, ed. J. Schneider F. Åström, R. Danell, B. Larsen. Leuven, Belgium: International Society for Scientometrics and Informetrics, pp 9–24

Picasso A, Merello S, Ma Y, Oneto L, Cambria E (2019) Technical analysis and sentiment embeddings for market trend prediction. Expert Syst Appl 135:60–70. https://doi.org/10.1016/j.eswa.2019.06.014

Piryani R, Madhavi D, Singh VK (2017) Analytical mapping of opinion mining and sentiment analysis research during 2000–2015. Inf Process Manag 53(1):122–150. https://doi.org/10.1016/j.ipm.2016.07.001

Piryani R, Piryani B, Singh VK, Pinto D (2020) Sentiment analysis in Nepali: exploring machine learning and lexicon-based approaches. J Intell Fuzzy Syst 39(2):2201–2212. https://doi.org/10.3233/JIFS-179884

Plaza-del-Arco FM, Martín-Valdivia MT, Ureña-López LA, Mitkov R (2020) Improved emotion recognition in spanish social media through incorporation of lexical knowledge. Futur Gener Comput Syst 110:1000–1008. https://doi.org/10.1016/j.future.2019.09.034

Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl Based Syst 108:42–49. https://doi.org/10.1016/j.knosys.2016.06.009

Prabha MI, Srikanth GU (2019). Survey of Sentiment Analysis Using Deep Learning Techniques. In: 2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT), IEEE, p 1–9. https://doi.org/10.1109/ICIICT1.2019.8741438

Prabhat A, Khullar V (2017). Sentiment Classification on Big Data Using Naïve Bayes and Logistic Regression. In: 2017 International Conference on Computer Communication and Informatics (ICCCI), IEEE, p 1–5. https://doi.org/10.1109/ICCCI.2017.8117734

Preethi PG, Uma V, Kumar A (2015) Temporal sentiment analysis and causal rules extraction from Tweets for event prediction. Procedia Comput Sci 48:84–89. https://doi.org/10.1016/j.procs.2015.04.154

Qasem M, Thulasiram R, Thulasiram P (2015) Twitter Sentiment Classification Using Machine Learning Techniques for Stock Markets. In: 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE, p 834–840. https://doi.org/10.1109/ICACCI.2015.7275714

Qazi A, Fayaz H, Wadi A, Raj RG, Rahim NA, Khan WA (2015) The artificial neural network for solar radiation prediction and designing solar systems: a systematic literature review. J Clean Prod 104:1–12. https://doi.org/10.1016/j.jclepro.2015.04.041

Qazi A, Raj RG, Hardaker G, Standing C (2017) A systematic literature review on opinion types and sentiment analysis techniques: tasks and challenges. Internet Res 27(3):608–630. https://doi.org/10.1108/IntR-04-2016-0086

Raghuvanshi N, Patil JM (2016) A Brief Review on Sentiment Analysis. In: 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), IEEE, p 2827–2831. https://doi.org/10.1109/ICEEOT.2016.7755213

Rambocas M, Pacheco BG (2018) Online sentiment analysis in marketing research: a review. J Res Interact Mark 12(2):146–163. https://doi.org/10.1108/JRIM-05-2017-0030

Rao G, Gu X, Feng Z, Cong Q, Zhang L (2021) A Novel Joint Model with Second-Order Features and Matching Attention for Aspect-Based Sentiment Analysis. In: 2021 International Joint Conference on Neural Networks (IJCNN), IEEE, p 1–8. https://doi.org/10.1109/IJCNN52387.2021.9534321

Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl Based Syst 89:14–46. https://doi.org/10.1016/j.knosys.2015.06.015

Rotta R, Noack A (2011) Multilevel local search algorithms for modularity clustering. ACM J Exp Algorithmics 16(2):1–27. https://doi.org/10.1145/1963190.1970376

Article   MATH   MathSciNet   Google Scholar  

Sadamitsu K, Sekine S, Yamamoto M (2008) Sentiment Analysis Based on Probabilistic Models Using Inter-Sentence Information. In: Proceedings of the sixth international conference on language resources and evaluation (LREC’08), European Language Resources Association (ELRA), p 2892–2896. http://www.lrec-conf.org/proceedings/lrec2008/pdf/736_paper.pdf

Salur MU, Aydin I (2020) A novel hybrid deep learning model for sentiment classification. IEEE Access 8:58080–58093. https://doi.org/10.1109/ACCESS.2020.2982538

Sánchez-Rada JF, Iglesias CA (2019) Social context in sentiment analysis: formal definition, overview of current trends and framework for comparison. Inf Fusion 52:344–356. https://doi.org/10.1016/j.inffus.2019.05.003

Santos R, Costa AA, Silvestre JD, Pyl L (2019) Informetric analysis and review of literature on the role of BIM in sustainable construction. Autom Constr 103:221–234. https://doi.org/10.1016/j.autcon.2019.02.022

Sari IC, Ruldeviyani Y (2020) Sentiment Analysis of the Covid-19 Virus Infection in Indonesian Public Transportation on Twitter Data: A Case Study of Commuter Line Passengers. In: 2020 International Workshop on Big Data and Information Security (IWBIS), IEEE, pp 23–28. https://doi.org/10.1109/IWBIS50925.2020.9255531

Sarsam SM, Al-Samarraie H, Alzahrani AI, Wright B (2020) Sarcasm detection using machine learning algorithms in Twitter: a systematic review. Int J Mark Res 62(5):578–598. https://doi.org/10.1177/1470785320921779

Sayed AA, Elgeldawi E, Zaki AM, Galal AR (2020) Sentiment Analysis for Arabic Reviews Using Machine Learning Classification Algorithms. In: 2020 International Conference on Innovative Trends in Communication and Computer Engineering (ITCE), IEEE, p 56–63. https://doi.org/10.1109/ITCE48509.2020.9047822

Schouten K, Frasincar F (2015) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 28(3):813–830. https://doi.org/10.1109/TKDE.2015.2485209

Schuller B, Mousa AED, Vryniotis V (2015) Sentiment analysis and opinion mining: on optimal parameters and performances. Wiley Interdiscip Rev 5(5):255–263. https://doi.org/10.1002/widm.1159

Serrano-Guerrero J, Romero FP, Olivas JA (2021) Fuzzy logic applied to opinion mining: a review. Knowl Based Syst 222:107018. https://doi.org/10.1016/j.knosys.2021.107018

Sharma S, Jain A (2020) Role of sentiment analysis in social media security and analytics. Wiley Interdiscip Rev 10(5):e1366. https://doi.org/10.1002/widm.1366

Shirsat VS, Jagdale RS, Deshmukh SN (2018) Document Level Sentiment Analysis from News Articles. In: 2017 International Conference on Computing, Communication, Control and Automation (ICCUBEA), IEEE, pp 1–4. https://doi.org/10.1109/ICCUBEA.2017.8463638

Shofiya C, Abidi S (2021) Sentiment analysis on Covid-19-related social distancing in Canada using Twitter data. Int J Environ Res Public Health 18(11):5993. https://doi.org/10.3390/ijerph18115993

Singh RK, Sachan MK, Patel RB (2021) 360 Degree view of cross-domain opinion classification: a survey. Artif Intell Rev 54(2):1385–1506. https://doi.org/10.1007/s10462-020-09884-9

Singh T, Kumari M (2016) Role of text pre-processing in twitter sentiment analysis. Procedia Comput Sci 89:549–554. https://doi.org/10.1016/j.procs.2016.06.095

Smailović J, Grčar M, Lavrač N, Žnidaršič M (2014) Stream-based active learning for sentiment analysis in the financial domain. Inf Sci 285(1):181–203. https://doi.org/10.1016/j.ins.2014.04.034

Smetanin S (2020) The applications of sentiment analysis for Russian language texts: current challenges and future perspectives. IEEE Access 8:110693–110719. https://doi.org/10.1109/ACCESS.2020.3002215

Stemler S (2000) An overview of content analysis. Pract Assess Res Eval 7(1):1–16. https://doi.org/10.1362/146934703771910080

Sutoyo E, Rifai AP, Risnumawan A, Saputra M (2022) A comparison of text weighting schemes on sentiment analysis of government policies: a case study of replacement of national examinations. Multimed Tools Appl 81(5):6413–6431. https://doi.org/10.1007/s11042-022-11900-9

Syed AZ, Aslam M, Martinez-Enriquez AM (2010) Lexicon Based Sentiment Analysis of Urdu Text Using SentiUnits. In: Mexican international conference on artificial intelligence, Springer, Berlin, Heidelberg, pp 32–43. https://doi.org/10.1007/978-3-642-16761-4_4

Taboada M (2016) Sentiment analysis: an overview from linguistics. Annu Rev Linguist 2:325–347. https://doi.org/10.1146/annurev-linguistics-011415-040518

Tai KS, Socher R, Manning CD (2015) Improved Semantic Representations from Tree-Structured Long Short-Term Memory Networks. In: Proceedings of the 53rd Annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, association for computational linguistics, pp 1556–1566. https://doi.org/10.3115/v1/p15-1150

Tammina S (2020) A Hybrid Learning Approach for Sentiment Classification in Telugu Language. In: 2020 International conference on Artificial Intelligence and Signal Processing (AISP), IEEE, p 1–6. https://doi.org/10.1109/AISP48273.2020.9073109

Tan S, Cheng X, Wang Y, Xu H (2009) Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis. In: European Conference on Information Retrieval, Springer, Berlin, Heidelberg, p 337–349. https://doi.org/10.1007/978-3-642-00958-7_31

Tan X, Cai Y, Xu J, Leung H-F, Chen W, Li Q (2020) Improving aspect-based sentiment analysis via aligning aspect embedding. Neurocomputing 383:336–347. https://doi.org/10.1016/j.neucom.2019.12.035

Tembhurne JV, Diwan T (2021) Sentiment analysis in textual, visual and multimodal inputs using recurrent neural networks. Multimed Tools Appl 80(5):6871–6910. https://doi.org/10.1007/s11042-020-10037-x

Thakur RK, Deshpande MV (2019) Kernel optimized-support vector machine and mapreduce framework for sentiment classification of train reviews. Int J Uncertain Fuzziness Knowl Based Syst 27(6):1025–1050. https://doi.org/10.1142/S0218488519500454

Thelwall M, Buckley K, Paltoglou G (2012) Sentiment strength detection for the social web. J Am Soc Inform Sci Technol 63(1):163–173. https://doi.org/10.1002/asi.21662

Thet TT, Na JC, Khoo CSG (2010) Aspect-based sentiment analysis of movie reviews on discussion boards. J Inf Sci 36(6):823–848. https://doi.org/10.1177/0165551510388123

Trilla A, Alías F (2009) Sentiment Classification in English from Sentence-Level Annotations of Emotions Regarding Models of Affect. In: 10th Annual Conference of the International Speech Communication Association, International Speech Communication Association (ISCA), p 516–519. https://doi.org/10.21437/interspeech.2009-189

Trisna KW, Jie HJ (2022) Deep learning approach for aspect-based sentiment classification: a comparative review. Appl Artif Intell. https://doi.org/10.1080/08839514.2021.2014186

Valverde-Albacete FJ, Carrillo-de-Albornoz J, Peláez-Moreno C (2013) A Proposal for New Evaluation Metrics and Result Visualization Technique for Sentiment Analysis Tasks. In: International conference of the cross-language evaluation forum for European languages, Springer, Berlin, Heidelberg, p 41–52. https://doi.org/10.1007/978-3-642-40802-1_5

Van Eck NJ, Waltman L (2010) Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics 84(2):523–538. https://doi.org/10.1007/s11192-009-0146-3

Verma S (2022) Sentiment analysis of public services for smart society: literature review and future research directions. Gov Inf Quart 39(3):101708. https://doi.org/10.1016/j.giq.2022.101708

Waila P, Marisha S, Singh VK, Singh MK (2012) Evaluating Machine Learning and Unsupervised Semantic Orientation Approaches for Sentiment Analysis of Textual Reviews. In: 2012 IEEE International conference on computational intelligence and computing research, IEEE, pp 1–6. https://doi.org/10.1109/ICCIC.2012.6510235

Waltman L, Van Eck NJ (2013) A smart local moving algorithm for large-scale modularity-based community detection. Eur Phys J B 86(11):1–33. https://doi.org/10.1140/epjb/e2013-40829-0

Waltman L, Van Eck NJ, Noyons ECM (2010) A unified approach to mapping and clustering of bibliometric networks. J Inform 4(4):629–635. https://doi.org/10.1016/j.joi.2010.07.002

Wang C, Yang X, Ding L (2021) Deep learning sentiment classification based on weak tagging information. IEEE Access 9:66509–66518. https://doi.org/10.1109/ACCESS.2021.3077059

Wang L, Wan Y (2011) Sentiment Classification of Documents Based on Latent Semantic Analysis. In: International conference on computer education, simulation and modeling, Springer, Berlin, Heidelberg, p 356–361. https://doi.org/10.1007/978-3-642-21802-6_57

Wang T, Lu K, Chow KP, Zhu Q (2020a) COVID-19 sensing: negative sentiment analysis on social media in China via BERT model. IEEE Access 8:138162–138169. https://doi.org/10.1109/ACCESS.2020.3012595

Wang Z, Chong CS, Lan L, Yang Y, Ho S-B, Tong JC (2016) Fine-Grained Sentiment Analysis of Social Media with Emotion Sensing. In: 2016 Future Technologies Conference (FTC), IEEE, pp 1361–1364. https://doi.org/10.1109/FTC.2016.7821783

Wang Z, Ho S-B, Cambria E (2020b) A review of emotion sensing: categorization models and algorithms. Multimed Tools Appl 79(47):35553–35582. https://doi.org/10.1007/s11042-019-08328-z

Wang Z, Ho S-B, Cambria E (2020c) Multi-level fine-scaled sentiment sensing with ambivalence handling. Int J Uncertain Fuzziness Knowl-Based Syst 28(4):683–697. https://doi.org/10.1142/S0218488520500294

Wang Z, Lin Z (2020) Optimal feature selection for learning-based algorithms for sentiment classification. Cogn Comput 12(1):238–248. https://doi.org/10.1007/s12559-019-09669-5

Wang Z, Tong VJC, Chan D (2014) Issues of Social Data Analytics with a New Method for Sentiment Analysis of Social Media Data. In: 2014 IEEE 6th International conference on cloud computing technology and science, IEEE, pp 899–904. https://doi.org/10.1109/CloudCom.2014.40

Wang ZY, Li G, Li CY, Li A (2012) Research on the semantic-based co-word analysis. Scientometrics 90(3):855–875. https://doi.org/10.1007/s11192-011-0563-y

Wankhade M, Rao ACS, Kulkarni C (2022) A survey on sentiment analysis methods, applications, and challenges. Artif Intell Rev 55:5731–5780. https://doi.org/10.1007/s10462-022-10144-1

Xing FZ, Cambria E, Welsch RE (2018) Natural language based financial forecasting: a survey. Artif Intell Rev 50(1):49–73. https://doi.org/10.1007/s10462-017-9588-9

Xing FZ, Pallucchini F, Cambria E (2019) Cognitive-inspired domain adaptation of sentiment lexicons. Inf Process Manage 56(3):554–564. https://doi.org/10.1016/j.ipm.2018.11.002

Xiong Z, Qin K, Yang H, Luo G (2021) Learning Chinese word representation better by cascade morphological N-Gram. Neural Comput Appl 33(8):3757–3768. https://doi.org/10.1007/s00521-020-05198-7

Yang B, Shao B, Wu L, Lin X (2022) Multimodal sentiment analysis with unidirectional modality translation. Neurocomputing 467:130–137. https://doi.org/10.1016/j.neucom.2021.09.041

Yang L, Li Y, Wang J, Sherratt RS (2020a) Sentiment analysis for E-commerce product reviews in Chinese based on sentiment lexicon and deep learning. IEEE Access 8:23522–23530. https://doi.org/10.1109/ACCESS.2020.2969854

Yang M, Qu Q, Shen Y, Lei K, Zhu J (2020b) Cross-domain aspect/sentiment-aware abstractive review summarization by combining topic modeling and deep reinforcement learning. Neural Comput Appl 32(11):6421–6433. https://doi.org/10.1007/s00521-018-3825-2

Yi J, Niblack W (2005) Sentiment Mining in WebFountain. In: 21st International Conference on Data Engineering (ICDE’05), IEEE, p 1073–1083. https://doi.org/10.1109/ICDE.2005.132

Yin H, Yang S, Li J (2020) Detecting Topic and Sentiment Dynamics Due to COVID-19 Pandemic Using Social Media. In: International conference on advanced data mining and applications, Springer, Cham, p 610–623. https://doi.org/10.1007/978-3-030-65390-3_46

You L, Li Y, Wang Y, Zhang J, Yang Y (2016) A deep learning-based RNNs model for automatic security audit of short messages. In: 2016 16th International Symposium on Communications and Information Technologies (ISCIT), IEEE, p 225–229. https://doi.org/10.1109/ISCIT.2016.7751626

You T, Yoon J, Kwon O-H, Jung W-S (2021) Tracing the evolution of physics with a keyword co-occurrence network. J Korean Phys Soc 78(3):236–243. https://doi.org/10.1007/s40042-020-00051-5

Yu J, Jiang J, Xia R (2019) Entity-sensitive attention and fusion network for entity-level multimodal sentiment classification. IEEE/ACM Trans Audio Speech Lang Process 28:429–439. https://doi.org/10.1109/TASLP.2019.2957872

Yuan JH, Wu Y, Lu X, Zhao YY, Qin B, Liu T (2020) Recent advances in deep learning based sentiment analysis. Sci China Technol Sci 63(10):1947–1970. https://doi.org/10.1007/s11431-020-1634-3

Yue L, Chen W, Li X, Zuo W, Yin M (2019) A survey of sentiment analysis in social media. Knowl Inf Syst 60(2):617–663. https://doi.org/10.1007/s10115-018-1236-4

Yurtalan G, Koyuncu M, Turhan Ç (2019) A polarity calculation approach for lexicon-based Turkish sentiment analysis. Turk J Electr Eng Comput Sci 27(2):1325–1339. https://doi.org/10.3906/elk-1803-92

Zhang L, Wang S, Liu B (2018) Deep learning for sentiment analysis: a survey. Wiley Interdiscip Rev 8(4):e1253. https://doi.org/10.1002/widm.1253

Zhang Y, Du J, Ma X, Wen H, Fortino G (2021) Aspect-based sentiment analysis for user reviews. Cogn Comput 13(5):1114–1127. https://doi.org/10.1007/s12559-021-09855-4

Zhang Y, Zhang Z, Miao D, Wang J (2019) Three-way enhanced convolutional neural networks for sentence-level sentiment classification. Inf Sci 477:55–64. https://doi.org/10.1016/j.ins.2018.10.030

Zhao N, Gao H, Wen X, Li H (2021) Combination of convolutional neural network and gated recurrent unit for aspect-based sentiment analysis. IEEE Access 9:15561–15569. https://doi.org/10.1109/ACCESS.2021.3052937

Zhou J, Ye J (2020) Sentiment analysis in education research: a review of journal publications. Interact Learn Environ. https://doi.org/10.1080/10494820.2020.1826985

Zucco C, Calabrese B, Agapito G, Guzzi PH, Cannataro M (2020) Sentiment analysis for mining texts and social networks data: methods and tools. Wiley Interdiscip Rev 10(1):e1333. https://doi.org/10.1002/widm.1333

Zunic A, Corcoran P, Spasic I (2020) Sentiment analysis in health and well-being: systematic review. JMIR Med Inform 8(1):e16023. https://doi.org/10.2196/16023

Zuo E, Zhao H, Chen B, Chen Q (2020) Context-specific heterogeneous graph convolutional network for implicit sentiment analysis. IEEE Access 8:37967–37975. https://doi.org/10.1109/ACCESS.2020.2975244

Download references

Acknowledgements

The authors would like to thank the China Scholarship Council (CSC No. 202106850069) for its support for the visiting study.

This work has not received any funding.

Author information

Authors and affiliations.

Institute of High Performance Computing, A*STAR, 1 Fusionopolis Way, Singapore, 138632, Singapore

Jingfeng Cui & Seng-Beng Ho

School of Information Management, Nanjing Agricultural University, 1 Weigang, Nanjing, 210095, China

Jingfeng Cui

School of Computing and Information Systems, Singapore Management University, 80 Stamford Rd, Singapore, 178902, Singapore

Zhaoxia Wang

School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore

Erik Cambria

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Zhaoxia Wang .

Ethics declarations

Conflict of interest.

The authors declare that they have no conflict of interest or competing interest in this article.

Research involving human participants or animals

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cui, J., Wang, Z., Ho, SB. et al. Survey on sentiment analysis: evolution of research methods and topics. Artif Intell Rev 56 , 8469–8510 (2023). https://doi.org/10.1007/s10462-022-10386-z

Download citation

Accepted : 29 December 2022

Published : 06 January 2023

Issue Date : August 2023

DOI : https://doi.org/10.1007/s10462-022-10386-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Sentiment analysis
  • Keyword co-occurrence analysis
  • Evolution analysis
  • Research methods
  • Research topics
  • Find a journal
  • Publish with us
  • Track your research

COMMENTS

  1. A systematic review of social media-based sentiment analysis: Emerging

    2.1. The identification of research questions. Sentiment analysis techniques have been shown to enable individuals, organizations and governments to benefit from the wealth of meaningful information contained in the unstructured data of social media, and there has been a great deal of research devoted to the design of high-performance sentiment classifiers and their applications [1], [4], [5 ...

  2. (PDF) Sentiment Analysis for Social Media

    Abstract - Sentiment analysis, the automated extraction of. expressions of positive o r negative attitudes from text ha s received. considerable attention from researchers during the past decade ...

  3. A review on sentiment analysis and emotion detection from text

    Social networking platforms have become an essential means for communicating feelings to the entire world due to rapid expansion in the Internet era. Several people use textual content, pictures, audio, and video to express their feelings or viewpoints. Text communication via Web-based networking media, on the other hand, is somewhat overwhelming. Every second, a massive amount of unstructured ...

  4. A survey on sentiment analysis methods, applications, and challenges

    The rapid growth of Internet-based applications, such as social media platforms and blogs, has resulted in comments and reviews concerning day-to-day activities. Sentiment analysis is the process of gathering and analyzing people's opinions, thoughts, and impressions regarding various topics, products, subjects, and services. People's opinions can be beneficial to corporations, governments ...

  5. [2311.11250] A Comprehensive Review on Sentiment Analysis: Tasks

    Sentiment analysis (SA) is an emerging field in text mining. It is the process of computationally identifying and categorizing opinions expressed in a piece of text over different social media platforms. Social media plays an essential role in knowing the customer mindset towards a product, services, and the latest market trends. Most organizations depend on the customer's response and ...

  6. Sentiment Analysis of Social Media: Techniques, Applications, and

    Sentiment Analysis of Social Media: Techniques, Applications, and Reliability ... A. Goonetilleke and M. Kamruzzaman, Determining disaster severity through social media analysis: Testing the methodology with South East Queensland Flood tweets. ... Sentiment analysis on linkedin comments. International Journal of Engineering Research ...

  7. Sentiment analysis using deep learning techniques: a ...

    With the exponential growth of social media platforms and online communication, the necessity of using automated sentiment analysis techniques has significantly increased. Deep learning techniques have emerged in extracting complex patterns and features from unstructured text data, which makes them a powerful tool for sentiment analysis. This research article presents a comprehensive review of ...

  8. (Pdf) Exploring Sentiment Analysis Research: a Social Media Data

    The study explores sentiment analysis research f or its application in. transforming social media data and i dentifies relevant research aspects through a comprehensive. bibliometric review of 523 ...

  9. (PDF) Sentiment Analysis on Social Media

    Aside from research conducted on Twitter, other microblogging and social media sites, such as Facebook and Reddit, have also been scraped for various research projects that use sentiment analysis ...

  10. Advancing Sentiment Understanding in social media through Dynamic

    The main goal is to provide an in-depth overview of how these techniques have subsidized to the enhancement of sentiment analysis in the ever-evolving landscape of social media platforms. The research involves a comprehensive investigation into the fine-tuning and pre-training of these models on social media datasets. This paper provides an ...

  11. Sentiment analysis on social media tweets using dimensionality

    Sentiment analysis is described in References 7, 18-20 as a technique that uses NLP, data mining and statistical methods to extract the users' opinions and sentiments from social media data, for instance, tweets and other online resources like websites. It can be used to establish and understand how an audience reacts to a brand or issue either ...

  12. Free Full-Text

    Twitter has become a major social media platform and has attracted considerable interest among researchers in sentiment analysis. Research into Twitter Sentiment Analysis (TSA) is an active subfield of text mining. TSA refers to the use of computers to process the subjective nature of Twitter data, including its opinions and sentiments. In this research, a thorough review of the most recent ...

  13. Exploring Sentiment Analysis Techniques in Natural Language Processing

    Sentiment analysis has been applied in opinion mining, brand monitoring, market research, customer service, political analysis, and many more fields. The rise of social media platforms has made sentiment analysis even more essential, as individuals share their ideas and experiences on a massive scale.

  14. Systematic reviews in sentiment analysis: a tertiary study

    Mite-Baidal et al. research sentiment analysis in the education domain. E-learning is upcoming, and due to the online nature, lots of review data is generated on forums of MOOCS and social media. 13. Salah et al. research the social media sentiment analysis. Mainly twitter data is used because of the high dimensionality (e. g., retweets ...

  15. Sentiment Analysis on Social Media

    The Web is a huge virtual space where to express and share individual opinions, influencing any aspect of life, with implications for marketing and communication alike. Social Media are influencing consumersâ preferences by shaping their attitudes and behaviors. Monitoring the Social Media activities is a good way to measure customersâ loyalty, keeping a track on their sentiment towards ...

  16. An Overview of Social Media and Sentiment Analysis

    A wide range of frameworks are used in generation of enormous measure of information in various data formats like facts, content, sound, video. The social media accounts are major platform where people share their perspectives or u can say sentiments in above mentioned formats. In order to investigate the field of sentiment analysis social media platforms are major place that must be ...

  17. Social media-based demographic and sentiment analysis for disaster

    This study explores disaster responses across the United States for Winter Storm Jaxon in 2018 by utilizing demographic and sentiment analysis for Twitter®. This study finds that people show highly fluctuated responses across the study periods and highest natural sentiment, followed by positive sentiment and negative sentiment. Also, some sociodemographic and Twitter variables, such as gender ...

  18. (PDF) Twitter sentiment analysis

    In this paper, the focus is on sentiment analysis of web 3.0 enabled twitter dataset. ... object on social media. The public's sentiment is split into different categories namely, positive ...

  19. Sustainability

    The paper delineates the semi-structured interview-based methodology, shows results through word-cloud and sentiment analysis, and engages in discussions on consumer behaviour through four distinct clusters, concluding with limitations, managerial implications and outlining future research directions.

  20. Americans' Changing Relationship With Local News

    These are some of the key findings from a new Pew Research Center survey of about 5,000 U.S. adults conducted in January 2024. This is the first in a series of Pew Research Center reports on local news from the Pew-Knight Initiative, a research program funded jointly by The Pew Charitable Trusts and the John S. and James L. Knight Foundation.

  21. Sentiment analysis using Twitter data: a comparative application of

    In this paper, we implement social media data analysis to explore sentiments toward Covid-19 in England. This paper aims to examine the sentiments of tweets using various methods including lexicon and machine learning approaches during the third lockdown period in England as a case study.

  22. 'Biden-mobile' or the future of transportation? How EVs got polarized

    An analysis of social media content by The Post shows that conservative criticism of electric cars rose in August 2022 after California announced rules to phase out fully gas-powered cars by 2035 ...

  23. How bad bets on meme stocks led to a $1 billion wipeout

    But with this week's surge, those gains have vanished and left shorts with more than $1.2 billion in paper losses, according to research from S3 Partners. Nearly $1 billion of that was wiped out ...

  24. A survey of sentiment analysis in social media

    Sentiments or opinions from social media provide the most up-to-date and inclusive information, due to the proliferation of social media and the low barrier for posting the message. Despite the growing importance of sentiment analysis, this area lacks a concise and systematic arrangement of prior efforts. It is essential to: (1) analyze its progress over the years, (2) provide an overview of ...

  25. Trump Media Stock Rally Pushes Value of Trump's Stake to $6 Billion

    A recent analysis of the ownership of Trump Media shares by S&P Global Market Intelligence found that Mr. Trump, corporate insiders and an entity closely affiliated with Trump Media owned nearly ...

  26. Survey on sentiment analysis: evolution of research methods ...

    Sentiment analysis, one of the research hotspots in the natural language processing field, has attracted the attention of researchers, and research papers on the field are increasingly published. Many literature reviews on sentiment analysis involving techniques, methods, and applications have been produced using different survey methodologies and tools, but there has not been a survey ...