ISSR_Dark_Web_26Topics_Raid

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("D0men1c0/ISSR_Dark_Web_26Topics_Raid")

topic_model.get_topic_info()

You can make predictions as follows:

sentence = ['porn']
topic, _ = topic_model.transform(sentence)
topic_model.get_topic_info(topic[0])

Topic overview

  • Number of topics: 27
  • Number of training documents: 94407
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 free - how - accounts - logs - the 594 -1_free_how_accounts_logs
0 database - data - db - databases - dump 36319 Data Collection
1 hacking - security - course - windows - hacker 7773 Windows Hacking Course
2 mega - onlyfans - collection - gb - nsfw 5313 NSFW Mega OnlyFans Pack
3 email - pass - mail - email pass - access 4326 Email Pass Access Combo
4 the - kek - de - dodgy - vs 4292 MMA Fights
5 combo - combolist - hq - private - gaming 4707 Gaming Combo List
6 accounts - account - premium - capture - with 2961 Origin Account Capture
7 htb - flag - writeup - hackthebox - root 3649 Hack The Box Root Flags Writeup
8 leak - leaked - leaked download - database leaked - download 2275 Data Leak
9 hello - hi - new - hey - im 2027 Hello New Member
10 raid - raidforums - server - raiding - raid raid 1732 Server Raids
11 anime - manga - animes - best - naruto 2056 Anime Favorites
12 crypto - money - bitcoin - make - earn 1120 Earn Bitcoin Free Crypto Money Methods
13 vpn - nordvpn - nord - accounts - nord vpn 1811 NordVPN Premium Accounts
14 netflix - spotify - steam - minecraft - combo 1017 words or less
15 twitch - youtube - streamer - stream - raid 1136 Twitch Raid Streams
16 music - song - songs - favorite - cd 1260 Music Favorite
17 spotify - spotify premium - premium - accounts - spotify accounts 1002 Spotify Premium Accounts
18 dox - doxing - dox dox - doxed - doxxing 864 Doxing Tutorial
19 discord - discord server - server - tokens - discord tokens 810 Discord Server Tokens
20 scam - scam report - report - banned - banned scam 881 Scam
21 proxy - proxies - socks - proxy list - http 674 Proxy Lists
22 thread - help - what - new - time 636 Thread
23 netflix - hulu - accounts - premium - netflix premium 3631 Premium Netflix Accounts
24 instagram - twitter - followers - instagram account - account 773 Social Media Accounts for Sale
25 discord - raid - discord raid - raid discord - discord server 768 Discord Raids

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 4)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: True
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.36
  • UMAP: 0.5.6
  • Pandas: 2.2.1
  • Scikit-Learn: 1.4.1.post1
  • Sentence-transformers: 3.0.1
  • Transformers: 4.39.3
  • Numba: 0.60.0
  • Plotly: 5.22.0
  • Python: 3.12.2
Downloads last month
6
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.