News Story Category Prediction
About This Hackathon
<p>Welcome to Week 4 of the Weekly MachineHack Hackathon series! This week presents an exciting new data science challenge: creating a classification model to predict the relevant category label for provided news snippets within the test data. This multi-class text classification problem opens up numerous possibilities for innovative and effective solutions. We eagerly anticipate your creative approaches and outcomes.</p><h3>Event Duration</h3><ul><li><strong>Start Date:</strong> 18 July 2024</li><li><strong>End Date:</strong> 9 August 2024</li></ul><h3>Challenge Details</h3><p>Your task is to develop a classification model capable of accurately predicting the category of news stories. The categories range across various fields, including Style & Beauty, Parenting, Arts, Wellness, Religion, Entertainment, Women, Politics, and Travel. This problem requires participants to apply their natural language processing and machine learning knowledge to distinguish between these categories based on the provided text snippets.</p><h3>Participation and Benefits</h3><ul><li><strong>Intermediate Level:</strong> The hackathon is geared towards participants with a basic understanding of machine learning and text classification.</li><li><strong>Community Engagement:</strong> Join our vibrant community on Telegram to discuss ideas, ask questions, and collaborate with fellow participants.</li><li><strong>Certificates:</strong> All participants will receive a certificate from MachineHack, and winners will be prominently featured on the leaderboard.</li><li><strong>Live Walkthrough Session:</strong> A live session will be held on 24th July at 7 PM to guide participants through the challenge and offer valuable insights.</li></ul><h3>Submission and Evaluation</h3><ul><li><strong>Submission Format:</strong> Participants must submit their predictions in the specified format in the submission.csv file.</li><li><strong>Evaluation Metric:</strong> The models will be evaluated based on their accuracy in predicting the correct categories.</li><li><strong>Leaderboard:</strong> Stay updated on your progress and aim for the top spot on the leaderboard.</li></ul><h3>Data Description</h3><p>The dataset for this hackathon includes the following:</p><ul><li><strong>training.csv:</strong> Contains the training data with news headlines, descriptions, and their respective categories.</li><li><strong>test.csv:</strong> Test data that participants will use to generate predictions.</li><li><strong>submission.csv:</strong> The format for submitting your predictions.</li></ul><h3>How to Crack This Challenge</h3><p>Successfully cracking this challenge involves leveraging natural language processing techniques and machine learning algorithms. Here are some steps to get started:</p><ol><li><strong>Data Preprocessing:</strong> Clean the text data by removing stop words, and punctuation, and applying techniques like stemming or lemmatization.</li><li><strong>Feature Extraction:</strong> Convert the text data into numerical features using methods like TF-IDF or word embeddings.</li><li><strong>Model Selection:</strong> Experiment with various classification models such as Logistic Regression, Support Vector Machines, or advanced models like BERT.</li><li><strong>Evaluation and Tuning:</strong> Evaluate your models using cross-validation and tune hyperparameters to improve performance.</li></ol><p>For our subscribers, a starter notebook is available to help you kickstart your solution. This notebook provides a basic framework for data preprocessing and model building, which you can further enhance and customize.</p><h3>Getting Started</h3><ul><li><strong>Register Now:</strong> Ensure you are registered for the event to participate and receive updates.</li><li><strong>Download the Dataset:</strong> Access the dataset from the MachineHack platform to begin working on your solution.</li><li><strong>Join the Community:</strong> Engage with fellow participants and mentors through our Telegram group for support and collaboration.</li></ul><h3>Support and Resources</h3><p>For any questions or assistance, feel free to contact the support team at support@machinehack.com. Stay updated with the latest information and announcements by subscribing to our newsletter.</p><p>Happy Hacking and Growing! 🚀</p>
Key Information
- Category: Hackathon
- Difficulty Level: Intermediate
- Status: Expired
- Start Date: 2024-07-18T21:20:00Z
- End Date: 2024-08-11T23:23:59Z
- Current Participants: 198
Prizes and Awards
Knowledge
Rules and Guidelines
<ul><li>The participants are required to provide the code for the work done.</li><li>The output of the code should match the submission file with for the "Best Score" achieved by the participant.</li></ul>
Evaluation Criteria
<p>The evaluation will be performed using the <strong>"Accuracy"</strong> metric between the submitted and the result file.</p>
Quick Summary
News Story Category Prediction is a intermediate level hackathon currently expired. It has 198 participants. Prizes include: Knowledge. The event runs from 2024-07-18T21:20:00Z to 2024-08-11T23:23:59Z.Registration is free and open to all skill levels.
