Skip to main contentSkip to navigation
MachineHack Gen AI Logo
News Story Category Prediction Logo

NewsStoryCategoryPrediction

Expired
Start: July 18, 2024Ends: August 11, 2024
Participants
198
Time Left
Ended
Subs/day
9
Challenge Overview

Welcome to Week 4 of the Weekly MachineHack Hackathon series! This week presents an exciting new data science challenge: creating a classification model to predict the relevant category label for provided news snippets within the test data. This multi-class text classification problem opens up numerous possibilities for innovative and effective solutions. We eagerly anticipate your creative approaches and outcomes.

Event Duration

  • Start Date: 18 July 2024
  • End Date: 9 August 2024

Challenge Details

Your task is to develop a classification model capable of accurately predicting the category of news stories. The categories range across various fields, including Style & Beauty, Parenting, Arts, Wellness, Religion, Entertainment, Women, Politics, and Travel. This problem requires participants to apply their natural language processing and machine learning knowledge to distinguish between these categories based on the provided text snippets.

Participation and Benefits

  • Intermediate Level: The hackathon is geared towards participants with a basic understanding of machine learning and text classification.
  • Community Engagement: Join our vibrant community on Telegram to discuss ideas, ask questions, and collaborate with fellow participants.
  • Certificates: All participants will receive a certificate from MachineHack, and winners will be prominently featured on the leaderboard.
  • Live Walkthrough Session: A live session will be held on 24th July at 7 PM to guide participants through the challenge and offer valuable insights.

Submission and Evaluation

  • Submission Format: Participants must submit their predictions in the specified format in the submission.csv file.
  • Evaluation Metric: The models will be evaluated based on their accuracy in predicting the correct categories.
  • Leaderboard: Stay updated on your progress and aim for the top spot on the leaderboard.

Data Description

The dataset for this hackathon includes the following:

  • training.csv: Contains the training data with news headlines, descriptions, and their respective categories.
  • test.csv: Test data that participants will use to generate predictions.
  • submission.csv: The format for submitting your predictions.

How to Crack This Challenge

Successfully cracking this challenge involves leveraging natural language processing techniques and machine learning algorithms. Here are some steps to get started:

  1. Data Preprocessing: Clean the text data by removing stop words, and punctuation, and applying techniques like stemming or lemmatization.
  2. Feature Extraction: Convert the text data into numerical features using methods like TF-IDF or word embeddings.
  3. Model Selection: Experiment with various classification models such as Logistic Regression, Support Vector Machines, or advanced models like BERT.
  4. Evaluation and Tuning: Evaluate your models using cross-validation and tune hyperparameters to improve performance.

For our subscribers, a starter notebook is available to help you kickstart your solution. This notebook provides a basic framework for data preprocessing and model building, which you can further enhance and customize.

Getting Started

  • Register Now: Ensure you are registered for the event to participate and receive updates.
  • Download the Dataset: Access the dataset from the MachineHack platform to begin working on your solution.
  • Join the Community: Engage with fellow participants and mentors through our Telegram group for support and collaboration.

Support and Resources

For any questions or assistance, feel free to contact the support team at support@machinehack.com. Stay updated with the latest information and announcements by subscribing to our newsletter.

Happy Hacking and Growing! 🚀

Problem Statement

This challenge focuses on building advanced machine learning models to solve real-world problems. Participants will work with carefully curated datasets and compete to achieve the best performance metrics.

Target Column: category
Metric: accuracy_score
Level: Intermediate
Submissions: 9/day
Top Submissions

No leaderboard data available

Check back later for updates

News Story Category Prediction

Registration is open

Similar Challenges

Discover similar AI and data science competitions

No sponsored hackathons available at the moment.

Never Miss a Hackathon

Get notified about new AI hackathons, data science competitions, and exclusive opportunities. Join 50,000+ developers staying ahead of the curve.

No spam, unsubscribe at any time. We respect your privacy.