Intelligence-24-25 Recruitment Submission

By Vishruth V Srivatsa (231DS034)

This repository contains my completed recruitment task, divided into three main sections. Each task has been broken down into subtasks, with detailed instructions and outputs documented in the individual README files within the respective task directories.

Task 1 - Kaggle Competition

This task involved building computer vision and NLP models on the provided datasets for a Kaggle competition. Both tasks were completed successfully.

Subtask 1.1 - Computer Vision

I developed a CNN model for DeepFake detection. Initially, I incorrectly assigned labels to the images, but I corrected this with a simple logic after the model predictions, avoiding the need for retraining. I also focused on using weight initialization techniques. Initially, I was only achieving 50% validation accuracy, with no improvement in loss during training. However, after experimenting with weight initialization, the model's accuracy jumped to 95%, confirming the importance of initialization in this project.

Subtask 1.2 - NLP

For the NLP subtask, I built a classifier model using Logistic Regression. I used GridSearchCV for hyperparameter tuning, optimizing the model for better performance.

Task 2 - Underwater Image Enhancement Project

This task involved developing models for image enhancement, with a comparison of different approaches based on evaluation metrics.

Subtask 2.1 - Variational Autoencoders (VAEs)

I trained a Variational Autoencoder (VAE) model to enhance underwater images, improving their clarity and quality.

Subtask 2.2 - GAN on MNIST

I implemented a Generative Adversarial Network (GAN) model on the MNIST dataset, successfully generating new digit samples as part of this experiment.

Subtask 2.3 - GAN on Given Dataset

For this subtask, I implemented a GAN model on the provided dataset. One key aspect of this project was working with a specified pix2pix loss function, which was fascinating to learn and apply.

Subtask 2.4 - Diffusion Model

I implemented a Diffusion Model for image enhancement. The base architecture was provided, and I built on that to enhance underwater images.

Task 3 - RAG System

In this task, I developed a complete Retrieval-Augmented Generation (RAG) system. One challenge I faced was accessing the Llama model through an API key, which required access to a gated repository. After learning this was the wrong approach, I pulled the Llama3 model locally using Ollama and integrated it with Langchain. This task was a deep dive into RAG systems, Langchain, and AI agents, which significantly enhanced my understanding of these technologies.

Conclusion

This recruitment task provided an incredible learning experience, allowing me to dive into various new topics. I gained practical coding experience and developed real-world models in just a week—an opportunity that would have otherwise taken much longer if I had tackled these topics individually. It also highlighted the importance of applying theoretical knowledge in a hands-on setting.

Thank you for this opportunity!

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
Tasks		Tasks
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intelligence-24-25 Recruitment Submission

By Vishruth V Srivatsa (231DS034)

Table of Contents

Task 1 - Kaggle Competition

Subtask 1.1 - Computer Vision

Subtask 1.2 - NLP

Task 2 - Underwater Image Enhancement Project

Subtask 2.1 - Variational Autoencoders (VAEs)

Subtask 2.2 - GAN on MNIST

Subtask 2.3 - GAN on Given Dataset

Subtask 2.4 - Diffusion Model

Task 3 - RAG System

Conclusion

About

Releases

Packages

Languages

vishruth2005/Intelligence-24-25-Recs

Folders and files

Latest commit

History

Repository files navigation

Intelligence-24-25 Recruitment Submission

By Vishruth V Srivatsa (231DS034)

Table of Contents

Task 1 - Kaggle Competition

Subtask 1.1 - Computer Vision

Subtask 1.2 - NLP

Task 2 - Underwater Image Enhancement Project

Subtask 2.1 - Variational Autoencoders (VAEs)

Subtask 2.2 - GAN on MNIST

Subtask 2.3 - GAN on Given Dataset

Subtask 2.4 - Diffusion Model

Task 3 - RAG System

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages