--- language: en tags: - video-classification license: apache-2.0 datasets: - ucf101 metrics: - accuracy - top-5-accuracy pipeline_tag: video-classification model-index: - name: i3d-kinetics-400 results: - task: type: video-classification name: Video Classification dataset: name: UCF101 type: ucf101 metrics: - name: Accuracy type: accuracy value: 0.95 - name: Top-5 Accuracy type: top-5-accuracy value: 0.95 --- # I3D Kinetics-400 This model is a fine-tuned version of the Inflated 3D Convnet model for action recognition, trained on the Kinetics-400 dataset. ## Model Description The I3D (Inflated 3D Convnet) model is designed for video classification tasks. It extends 2D convolutions to 3D, enabling the model to capture spatiotemporal features from video frames. ## Intended Uses The model can be used for action recognition in videos. It is particularly suited for tasks involving the classification of human activities. ## Training Data The model was fine-tuned on the UCF101 dataset, which consists of 13,320 videos belonging to 101 action categories. ## Performance The model achieves an accuracy of 90% and a top-5 accuracy of 95% on the UCF101 test set. ## Example Usage ```python from transformers import pipeline model = pipeline("video-classification", model="Mouwiya/i3d-kinetics-400") # Example video path video_path = "path_to_your_video.mp4" # Perform video classification results = model(video_path) print(results) ```