Skip to content

Machine Learning Datasets and Resources

Detection Bots and Templates Utilizing ML

Did you create a ML bot? Share it with the community on the ML Discord Channel!

Sources for Training Data

Do you see any missing resource the community can use below? Contribute to the docs.

Datasets

  • Forta Datasets on Huggingface - Training datasets for malicious smart contract detection uploaded on HuggingFace.
  • Forta Labelled datasets - Web3 threat related labelled datasets for data analysis and machine learning developments.
  • EtherScamDB - Open-source db that keeps track of ethereum scams and involved addresses.
  • Forta API - Query critical alerts via the Forta API and use them as weak labels.
  • web3rekt.com - Query known blockchain incidents and scams.
  • XBlock - Access to all blockchain datasets used in academic research.

Blockchain Data

Data Science Competitions

Blog posts and Guides

ML Best Practices

Join the Forta ML Discord

Do you have ideas on how machine learning can be used on the Forta Network? Share your thoughts and ideas on the ML Discord Channel