Meta Content Library
Meta Content Library is a controlled-access tool that lets approved academic and non-profit researchers search the full public archive of Facebook, Instagram, and Threads posts, in near-real-time.
URL
https://transparency.meta.com/researchtools/meta-content-library/\
(as of the most recent update from May 5th, 2025)

Description
The Meta Content Library is a research tool that provides vetted academic and non-profit researchers with access to public posts from Facebook, Instagram, and Threads. It enables near real-time and historical analysis of social media content, facilitating studies on political discourse, public health trends, misinformation, and other topics. (Replaces deprecated CrowdTangle for research at scale; the only Meta-sanctioned source that exposes view-count (“exposure”) data.)
The platform features:
User Interface (UI): A web-based dashboard for content search and filtering.
API Access: Programmatic access for large-scale queries (Python or R via ICPSR's secure Virtual Data Enclave).
Since Feb 2025, the Library includes all public Threads posts that meet the 1 k-follower rule; engagement metrics and OCR text-in-image search work for Threads too.
Advanced Filtering: Keyword searches, engagement metrics, date ranges, and language filters.
Engagement Insights: View counts, reactions, shares, and comments with hashed user IDs.
Historical Data: Access dating back to Facebook's launch (2004), with updates in near real-time.
CSV Downloads: Available for accounts meeting the "widely known" criteria (e.g., Facebook Pages with ≥15,000 followers, Instagram accounts with ≥25,000 followers, Threads public profiles ≥ 1 000 followers). CSV export is only for Facebook & Instagram posts that meet the “widely-known” criteria; Threads is UI-only for now.
Privacy Protection: Content is subject to deletion/privacy settings, ensuring ethical use.
Cost
Level of difficulty
Moderate difficulty: The UI is user-friendly, but API access requires coding knowledge and data analysis skills.
Requirements
Researchers must be affiliated with an academic or non-profit institution.
Applications are vetted by the Inter-university Consortium for Political and Social Research (ICPSR).
No IRB approval required, but applicants must submit a research agenda.
User interface access is available upon approval; API access requires a separate virtual data enclave (VDE) credential, which requires additional documentation (research proposal approved by your IRB). No raw-file download from the API; analysis must stay inside the VDE.
Limitations
Restricted Access: Only available to approved researchers (not the general public).
Data Privacy: Deleted/private posts become unavailable; raw data cannot be exported from the API.
Query Limits: API access has a weekly cap of 500,000 retrieved content items.
Replication Challenges: Due to content deletion or privacy changes, perfect replication of results can be difficult.
No Personal Identifiers: User data is hashed unless the account is a public entity.
Data-retention rule: Researchers must delete data older than 180 days that is no longer in the library.
Ethical Considerations
Privacy Compliance: Researchers must respect data privacy and ethical standards.
No User Tracking: The platform does not allow longitudinal tracking of individuals.
Use Limitations: Data cannot be used for surveillance or commercial purposes.
Guide
Official Documentation: Meta Transparency Center
ICPSR Social Media Archive: https://www.icpsr.umich.edu/
"Public Data Access Programs: A First Look" is a comprehensive evaluation by Hickey, Dowling, Navia, and Pershan (2024, Mozilla Foundation). The study assesses how major platforms, including Meta, provide researchers access to public data under the Digital Services Act, with a focus on usability, transparency, and technical limitations. It highlights the platform's dual-access approach via a user-friendly dashboard and API while also noting challenges such as data consistency and limited documentation that can affect replicability and broader research applications.
In this two-part webinar series, led by Professor Anja Neundorf under the DEMED project, researchers at the University of Glasgow and specialists from Meta discuss the Meta Content Library’s evolving capabilities. The first session (2023) focuses on core concepts such as keyword-based searches and privacy safeguards for analyzing public Facebook and Instagram data. The second session (2025) highlights newly added features, including comment-level data, text-in-image matching, and Threads integration, to support more advanced social media research.
Tool provider
Meta Platforms, Inc. Menlo Park, California, U.S
Advertising Trackers
Martin Sona
Last updated
Was this helpful?