Skip to main contentSkip to navigation
MachineHack Gen AI Logo
MLDS 2025 | Sequence Classification  Logo

MLDS2025|
SequenceClassification

Expired
Start: December 23, 2024Ends: January 26, 2025
Participants
229
Time Left
Ended
Subs/day
3
Challenge Overview

Welcome to the MLDS 2025 Hackathon!

Problem Statement:

We’re excited to launch a unique challenge in the lead-up to MLDS 2025, where your skills in fine-tuning Small language models (SLMs) will be tested. This hackathon focuses on multi-class classification—your task is to fine-tune an SLM to classify data into multiple categories using the provided dataset accurately

Participation and Benefits

Skill Level:
Ideal for participants experienced in LLM fine-tuning, classification tasks, and exploring deep learning-based NLP solutions.

Community Engagement:
Be part of the MLDS community—engage with peers in our Telegram group, ask questions, and share insights during the competition.

Recognition:

  • All participants will receive a MachineHack certificate of participation.
  • The top 3 performers will not only earn bragging rights but also exclusive MLDS 2025 tickets giving them access to one of the largest gatherings of machine learning and data science professionals.

Submission and Evaluation

Submission Format:
Please submit the fine-tuned SLM model after testing its support and execution on the provided test script (link), and its dependencies before uploading to the portal. The LLM files will be accepted in .safetensors & .json formats.

Evaluation Metric:
Submissions will be evaluated based on classification accuracy, rewarding precise and consistent predictions.

Leaderboard:
Track your ranking live and aim for the top spot on the leaderboard!

How to Approach the Challenge

Note: Please train your model to predict the "label_model" column given in the train file from the inference approach as per this script (link).

Data Preprocessing

  • Text Cleaning: Remove unnecessary characters, noise, and symbols for cleaner input to your LLM.
  • Tokenization: Use LLM-specific tokenizers like Hugging Face’s AutoTokenizer for efficient encoding.

Feature Engineering

  • Label Encoding: Ensure proper encoding of class labels for seamless integration with model outputs.
  • Handling Imbalanced Data: Consider techniques like oversampling or weighted loss functions to address class imbalances.

Modeling Techniques

  • Fine-Tuning LLMs: Use models such as BERT, RoBERTa, or GPT for multi-class classification, fine-tuned on your dataset.
  • Transfer Learning: Leverage pre-trained weights to kickstart training and improve generalization.

Validation and Tuning

  • Cross-Validation: Implement robust k-fold validation for consistent performance.
  • Hyperparameter Tuning: Experiment with parameters like learning rate, batch size, and epochs to optimize results.

Getting Started

Download the Dataset:
As the competition starts, the training dataset will be ready for you to dive into.

Join the Community:
Collaborate, brainstorm, and troubleshoot with fellow participants in our Telegram group.

Support and Queries

For assistance, feel free to reach out to our team at support@machinehack.com.
Wishing you the best!

Problem Statement

This challenge focuses on building advanced machine learning models to solve real-world problems. Participants will work with carefully curated datasets and compete to achieve the best performance metrics.

Target Column: Genre
Metric: accuracy_score
Level: Intermediate
Submissions: 3/day
Top Submissions

No leaderboard data available

Check back later for updates

MLDS 2025 | Sequence Classification

Registration is open

Similar Challenges

Discover similar AI and data science competitions

No sponsored hackathons available at the moment.

Never Miss a Hackathon

Get notified about new AI hackathons, data science competitions, and exclusive opportunities. Join 50,000+ developers staying ahead of the curve.

No spam, unsubscribe at any time. We respect your privacy.

    MLDS 2025 | Sequence Classification | Hackathon Hackathon | MachineHack