huuuyeah/meetingbank
Viewer • Updated • 6.89k • 1.15k • 31
Custom PyTorch Transformer checkpoint trained on MeetingBank for meeting summarization research. This repository is part of the transformer-lab collection.
| Field | Value |
|---|---|
| Repository | Pradheep1647/run_hca-meetingbank-bs8-e20-fp32-19 |
| Attention | hca |
| Dataset | meetingbank |
| Layers | 6 |
| Hidden size | 512 |
| Heads | 8 |
| Batch size | 8 |
| Epochs | 20 |
| Precision | fp32 |
| Checkpoint | run_hca-meetingbank-bs8-e20-fp32-1919.pt |
Static architecture diagram generated from this run's config.json, including model width, depth, sequence dimensions, and attention-specific settings.
Raw curve data is available in loss_curve.csv.
| File | Purpose |
|---|---|
run_hca-meetingbank-bs8-e20-fp32-1919.pt |
PyTorch checkpoint containing model_state_dict, optimizer states, epoch, and global step. |
config.json |
Training and architecture config converted from the Hydra run config. |
architecture.png |
Architecture diagram generated from the saved model config, with block shapes and dimensions. |
tokenizer.json |
MeetingBank transcript tokenizer alias for source inputs. |
transcript_tokenizer.json |
Explicit MeetingBank transcript tokenizer. |
summary_tokenizer.json |
MeetingBank summary tokenizer for target text. |
loss_curve.csv |
TensorBoard train/loss scalar export. |
loss_curve.svg |
Static training-loss plot generated from loss_curve.csv. |
These checkpoints are from a custom PyTorch codebase, not a transformers.AutoModel checkpoint. Use the repo-native builder to instantiate the architecture, then load the checkpoint state dict.
from pathlib import Path
import torch
from huggingface_hub import hf_hub_download
from omegaconf import OmegaConf
import src # registers components
from src.model.builder import build_causal_lm
repo_id = "Pradheep1647/run_hca-meetingbank-bs8-e20-fp32-19"
config_path = hf_hub_download(repo_id=repo_id, filename="config.json")
checkpoint_path = hf_hub_download(repo_id=repo_id, filename="run_hca-meetingbank-bs8-e20-fp32-1919.pt")
cfg = OmegaConf.load(config_path)
model = build_causal_lm(cfg)
state = torch.load(checkpoint_path, map_location="cpu")
model.load_state_dict(state["model_state_dict"])
model.eval()
print(f"Loaded {repo_id} from {Path(checkpoint_path).name}")
config.json as the source of truth for architecture parameters.