This tool has been retired as of July 2025. See the new unified tool with similar functionalities at OSoMeNet.
Description
NOTE: Hoaxy was first developed in 2016. This toolkit description tested the most recent version and its features to date.
Hoaxy assists in unraveling the complex web of information dissemination online, particularly on X (Twitter) and Bluesky. It primarily addresses the problem of mis- and disinformation spreading rapidly on social media platforms. Users can search for a particular topic or keyword on either platform, and it will automatically generate a graph showing the spread of information.
For open-source researchers wanting to verify false claims, it is important to recognize that the anatomy of information dissemination has multiple moving parts: When did a narrative start? Where did it first appear? Who is the source of the narrative? How credible is the source? And who are the receivers/audience? Among the receivers, which accounts forwarded the information? How fast did a particular narrative spread within a particular time-frame?
Hoaxy solves some of these questions through the following key features:
Search: This allows users to find specific keywords, URLs, or users on Bluesky and X/Twitter that may need verification or monitoring.
Visualization: This offers users a visual representation of data that can illustrate who is sharing the information, the connections between different users, and the chronological order in which the information is spread.
How does it work?
Step 1: Choose either Bluesky or X/Twitter
Step 2: Enter your search terms
Step 3: Hoaxy automatically generates a visualization
Step 4: Analyze the visualization map and its data
The landing page of the Hoaxy tool automatically lets you choose your search preferences. You can choose to do a query on Bluesky or on X/Twitter or to import your own data. At the bottom you find a search bar for keywords, account, URL, phrases.
DOING A BLUESKY SEARCH
If the user is interested in tracking information spread on the Bluesky platform, users must select “Search Bluesky”, and then they can enter the following input on the search bar:
Single term (i.e. airstrike)
Phrases (enclosed in quotation marks, i.e. “Kyiv airstrike” )
Combine multiple terms using Boolean operators AND, OR, NOT (i.e. "Kyiv airstrike" AND (report OR news) NOT "false alarm")
DOING A X/TWITTER SEARCH
NOTES:
This function requires a Bearer Token that is only available if a user purchased a Basic, Pro or Premium+ subscription on X.
If the user is interested in tracking information spread on the X platform, users must select “Search X/Twitter”. Then, they can enter search terms in the same way they can use the advanced search feature of X (See X’s guide to building a query here or this guide). This tool is compatible with X’s advanced search operators.
For example: "Amazon deforestation" AND (illegal OR logging) has:videos
IMPORT YOUR OWN DATA
In addition to examining content from Bluesky and X, users can also import their own data into the tool’s visualization tool, as long as the data is in the required format: CSV and JSON.
VISUALIZATION
Once users have entered the search terms, Hoaxy will generate a visualization containing two things: “Diffusion networks” and “temporal trends”.
This screenshot shows the visualization automatically generated by Hoaxy for a Bluesky query for the keyword "airstrikes." The right panel shows "diffusion networks" (as described below), and on the left panel is the "temporal trends" (also described below). In between these panels are a few features that users can modify and play around with to change how the graph looks for analysis. The account shown in a yellow box (to the right) shows a cluster that is not part of the larger clusters. This represents a separate network or conversation where the keywords appear, but no user accounts overlap.
TIP: Hovering your mouse over the graph to either the left or right will show the dates and the number of posts generated on that particular date.
(Click to enlarge) This image shows the timeline of how posts about a specific topic spread over time. In this example, the tool automatically generates a timeframe from 13 March 2024 to 17 April 2024 (shown in the orange square). On the top of the image are options to "add old posts" or "add new posts" (shown in green circles).
Middle Panel
The middle panel shows different features that users can change depending on their preferences. It consists of layout options, export options, a 2D or 3D display, and a short clip showing how information spreads.
The screenshot shows the middle menu for the visualization and the different ways users can display their results for analysis.
The tool also allows you to play a short animated clip showing how the nodes and connections are made, and how it unfolds over time, from the beginning date to the end date as displayed on the left panel.
Clicking on the "Play" icon in the middle panel shows users how the information spreads from the beginning date, 29 April 2023 to the end date, 3 Aug 2024. This shows a Bluesky query looking for mentions of Bellingcat.
CHECKING YOUR RESULTS
It is recommended to review your results to understand the type of data and content presented and determine if it aligns with your research topic. To explore, users can choose a cluster and click a node or user account. This action displays a pop up window displaying accounts it interacts with- through mentions, replies, or quotes— and includes a link to that specific post. Additionally, clicking on the connecting lines (edges) will bring up a pop up showing the related post.
Click on a node, and it will show you actions made by that account and who it interacted with. In this example, we did a Bluesky search for @bellingcat.com. On the right side, we selected the account @scrippsnews.bsky.social, and to the left, a pop-up window appears to show the activity. Underneath (shown in green arrows) are links to the post or content being forwarded.
Click on an edge (the color-coded connecting lines) to automatically generate a pop-up window showing the content being shared from one account to another (or from one node to another).
Use case
The main use case for Hoaxy is for verification and to get a sense of the context in which a piece of information is shared:
Open-source researchers can analyze the network of users who shared a piece of information and identify whether the original sources are credible news outlets or dubious accounts known for spreading mis- or disinformation. This can be one of the first steps when trying to get a sense of the spread of a particular claim on social networks. It also provides a clearer understanding of the networks in which a piece of information is being shared as well as the timeframe during which the claim was most active. See here for an example of how it was used to debunk “Picklegate” during a presidential campaign season.
Cost
Bluesky search and importing your own data is free, but an X/Twitter search requires a Bearer Token that can only be available if the user pays for an X Premium subscription.
Level of difficulty
Requirements
X user account
X user account Bearer Token (available if user purchased a Basic, Pro or Premium+ subscription)
Limitations
Beta Status: Based on its landing page, Hoaxy still appears to be in the beta phase.
This may affect its accessibility and ease of use, especially for users who are less technically inclined. While it sometimes takes a bit longer to load the visualization graphs, it generally performs well.
Date/Time Frame Limitations: based on some tests, Hoaxy automatically chooses a date/time frame for your search. Users have the option to fetch older or newer posts, but the tool sometimes struggles to load the visualization after doing so. Users may try using date and time search operators to see if Hoaxy can fetch posts within a timeframe of interest.
Public Posts Only: Hoaxy can only retrieve and analyze public posts, leaving private conversations unexamined.
Cross-Platform Limitations: Hoaxy requires additional tools to track mis- and disinformation that spills over to other platforms, as it can rapidly spread from one platform to another. The dynamics of information spread vary significantly across different social media. As shown in this BBC Investigation, the spread of misinformation in the Southport protests started on one platform and then migrated to a different platform, where it was even more amplified.
Needs Some Familiarity with Network Analysis for Interpretation: To fully understand and make sense of the data visualizations provided by Hoaxy, it's helpful to have some background knowledge in network analysis.
Content Analysis: Hoaxy is not a tool for distinguishing between different types of mis- and disinformation. Users may need additional tools or methods to conduct detailed content analysis and identify various forms of mis- and disinformation accurately. According to UNESCO World Media Trends, “A claim may be a false news article, hoax, rumor, conspiracy theory, satire, or even an accurate report. (Hoaxy) is not meant to be used as a fact-checking tool but rather to understand how disinformation and misinformation is being spread by non-trustworthy accounts.”
Overall, while Hoaxy is created as a comprehensive tool, its reliance on specific data sources, limited platform scope, and occasional usability issues highlight some gaps in it's capabilties. Hoaxy is best used in addition to other open-source investigative tools.
Ethical Considerations
Contextual Clarity: When sharing the visualization with others, provide sufficient context to prevent misinterpretation.
User Anonymity: While Hoaxy might not reveal individual user identities, the tool can still expose patterns of behavior that could potentially lead to identification.
Tool provider
Hoaxy is a project of Indiana University's Observatory on Social Media (OSoMe).