Skip to main content

Data Science Blog

loading ·
Share with :

Welcome to my Data Science Blog!

·1092 words·6 mins· loading
Language Models (LLMs) Business & Career Societal Impact Artificial Intelligence (AI) Career Development AI Capabilities AI Integration
LLM Skills and Human Skills # What is skill? # A skill is the ability to perform a task or activity effectively and efficiently. It involves applying knowledge, experience, and techniques to achieve …
·2785 words·14 mins· loading
Language Models (LLMs) AI/ML Models Software Architecture & Design Language Models (LLMs) Deep Learning (DL) AI Model Training Neural Networks AI Development Machine Learning (ML) Transformer Models
Understanding LLM Architectures and Model Training # Large Language Models (LLMs) are transforming the field of artificial intelligence by enabling machines to understand and generate human language …
·1531 words·8 mins· loading
Language Models (LLMs) AI Ethics & Governance Cybersecurity AI Ethics and Governance Data Security Data Privacy AI Safety ML Security
LLM Security and Ethics Considerations # Question: For my client’s highly secured data like health industry data, banking, insurnace, internal security, etc. data can I use gpt4 for finetuning? …
·2968 words·14 mins· loading
Language Models (LLMs) AI/ML Models Machine Learning (ML) LLM Fine-Tuning Machine Learning (ML) AI Model Training AI Model Customization Language Models (LLMs) Transfer Learning Neural Networks
Finetuning, Fewshot Learning, Why and How? # Why to finetune a LLM? # Fine-tuning a large language model (LLM) can provide several benefits, depending on your specific needs and objectives. Here are …
·1693 words·8 mins· loading
Data Science Programming Computer Science Fundamentals Computer Science Fundamentals Text Processing Software Development Technology Standards Programming Data Fundamentals
What is Unicode and how does it works? # What is Unicode: A Universal Character Set # Unicode is a standard that assigns a unique number to every character, no matter the platform, program, or …
·1033 words·5 mins· loading
AI/ML Models Language Models (LLMs) Technology Trends & Future Language Models (LLMs) Specific AI Models AI Models AI Model Training Open Source AI Machine Learning (ML)
Stanford Alpaca # Introduction # Stanford Alpaca Github Report Stanford Alpaca is An “Instruction-following” LLaMA Model This is the repo aims to build and share an instruction-following …
·1972 words·10 mins· loading
Cybersecurity Software Development Cybersecurity Software Development Data Security
Software Security Concepts # As of today, this article is answering following questions related to software security. In future, I will add more concepts here. What are the major ideas which we need …
·2064 words·10 mins· loading
Language Models (LLMs) Generative AI AI/ML Models Language Models (LLMs) Generative AI Transformer Models Deep Learning (DL) AI Architecture Neural Networks Machine Learning (ML)
Understanding LLM, GAN and Transformers # LLM Layers # Large Language Models (LLMs) are typically based on Transformer architectures, which consist of several types of layers that work together to …
·7314 words·35 mins· loading
AI/ML Models Artificial Intelligence (AI) Transformer Models Deep Learning (DL) Natural Language Processing (NLP) Neural Networks Machine Learning (ML) AI Architecture Transformer Architecture
Transformers Demystified A Step-by-Step Guide # All modern Transformers are based on a paper “Attention is all you need” Introduction # This was the mother paper of all the transformer …
·2786 words·14 mins· loading
Data Analysis & Visualization Machine Learning (ML) Data Science Data Science Machine Learning (ML) Data Visualization Dimensionality Reduction Data Analysis
Dimensionality Reduction and Visualization # What are the popular methods of dimensionality reduction? # Dimensionality reduction is a crucial step in data preprocessing, particularly when dealing …
·712 words·4 mins· loading
Databases Cloud Computing DevOps & MLOps Serverless Computing Cloud Databases Database Management Cloud Computing Data Storage Cloud Architecture Databases
Serverless databases # A serverless database is a type of database service that automatically manages infrastructure and scaling, allowing developers to focus solely on building applications without …
·1636 words·8 mins· loading
Healthcare Technology AI Applications Healthcare AI Healthcare Technology
AI in Health Care # Human health is paramount for inviduals, governments, hospitals and other related systems. There are powerful laws and heavy panelty for the violations for these laws. In the …
·1346 words·7 mins· loading
Cybersecurity Algorithms Computer Science Fundamentals Cybersecurity Data Security Data Quality
All about Hashing # What is Hashing function? # A hashing function is a mathematical algorithm that converts an input (or “message”) into a fixed-size string of bytes, typically a hash …
·1074 words·6 mins· loading
Containerization Software Development Docker Containerization DevOps Development Tools Deployment
Creating Docker Image # What is Docker? # Docker is an open platform for developing, shipping, and running applications. Docker enables you to separate your applications from your infrastructure so …
·2404 words·12 mins· loading
API Development Web Development API Development Web Development Web Services Software Architecture Backend Development Networking
REST API # An API (Application Programming Interface) is a set of rules and protocols that allows one software application to interact with another. It defines the methods and data formats that …
·1589 words·8 mins· loading
Natural Language Processing (NLP) AI/ML Research & Evaluation Evaluation & Metrics Natural Language Processing (NLP) Machine Learning (ML) AI Benchmarks Language Models (LLMs) Evaluation NLP Evaluation AI Model Evaluation
NLP BenchMarks # What is Language Model? # A language model is a computational model that understands and generates human language. It learns the patterns and structure of a language by analyzing …
·1122 words·6 mins· loading
IT Infrastructure System Administration Operating Systems System Administration File Management Software Development
Decoding Windows User Folder # My machine c:\users folder has 3 users Detault, harip, public. What is the purpose of these 3 users? # The C:\Users folder on a Windows machine contains subfolders for …
·2871 words·14 mins· loading
Python Programming Development Environment & Tools Python Software Development Tools Development Tools Development Environment Software Development
Decoding pip install operations # Your draft provides useful insights into using pip for Python package management. Here’s a refined version of your article with improved structure, grammar, …
·4484 words·22 mins· loading
Containerization Development Environment & Tools Programming Docker Containerization DevOps Software Development Development Tools System Administration
Decoding docker commands # Is this article for me? # If you are coming from IT Infrastructure background and have solid experience in containerization you can skip this. But if you are seeking to …
·24 words·1 min· loading
Data Science Natural Language Processing (NLP) Interdisciplinary Topics Corpus AI Use Cases Literature Sanskrit NLP Concepts Digital Humanities Ancient Texts
Manmath Nath - Ramayana Corpus # Corpus Introduction Corpus License Bala Kanda Ayodhya Kanda Aranyaka Kanda Kishkinddha Kanda Sunder Kanda Lanka Kanda Utter Kanda
·133 words·1 min· loading
Data Science Natural Language Processing (NLP) Interdisciplinary Topics Corpus Literature Sanskrit NLP Concepts Digital Humanities Ancient Texts
KM Ganguli Mahabharat Corpus # Adi Parva (The Book of the Beginning) Sabha Parva (The Book of the Assembly Hall) Vana Parva or Aranyaka-Parva (The Book of the Forest) Virata Parva (The Book of …
·547 words·3 mins· loading
AI Applications Industry Applications AI in Government Public Services Digital Transformation Urban Technology E-Governance
AI Usecases in Government # Preliminary Work before any AI Project with Government # Keeping technology evolution, cost of latest technologies, government expectations in mind one should do following …
·2044 words·10 mins· loading
Education Technology AI Applications AI in Education Education Technology
AI in School Education # Introduction # In the ever-evolving landscape of education, a technological revolution is quietly reshaping the way students learn, teachers instruct, and schools operate. At …
·3128 words·15 mins· loading
Data Science Astronomy Ancient Texts Predictive Analytics
Basics of Jyotish # Introduction # In this article I am going to discuss the basics of astrology and the data science aspect of astrology. I am defending here a big case of model retraining and …
·116 words·1 min· loading
Philosophy & Cognitive Science Learning Resources Business & Career Personal Development Learning Resources
Download Link to this Diary The Inspirational Leader by Gifford Thomas The 5 Elements of Effective Thinking by Edward B. Burger How to Listen by Trimboli Self-discipline in 10 Days by Theodore Bryant …
·949 words·5 mins· loading
Artificial Intelligence (AI) Natural Language Processing (NLP) AI Applications Natural Language Processing (NLP) Language Models (LLMs) Text Analysis Machine Learning (ML) NLP Applications Text Processing
Empowering-Language-with-AI-NLP-Capabilities # Introduction # When envisioning artificial intelligence (AI), the initial images that often come to mind are humanoid robots. However, this perception …
·225 words·2 mins· loading
Natural Language Processing (NLP) Data Analysis & Visualization AI/ML Models Language Models (LLMs) NLP Applications Text Analysis Natural Language Processing (NLP) Machine Learning (ML) Text Mining
Topic Modeling with BERT # Key steps in BERTopic modelling are as following. Use “Sentence Embedding” models to embed the sentences of the article Reduce the dimensionality of embedding …
·585 words·3 mins· loading
Prompt Engineering Artificial Intelligence (AI) Prompt Engineering Language Models (LLMs) AI Reasoning Neural Networks Machine Learning (ML) Cognitive Computing Artificial Intelligence (AI)
Graph of Thoughts # This is a valuable resource for learning Graph of Thoughts (GoT) concepts. The YouTube video is from code_your_own_AI. I’m utilizing the comments made by @wesleychang2005 on …
·2238 words·11 mins· loading
Natural Language Processing (NLP) AI/ML Models Data Science NLP Concepts Natural Language Processing (NLP) Data Representation Text Processing Machine Learning (ML) Neural Networks Language Models (LLMs)
Basics of Word Embedding # What is Context, target and window? # The “context” word is the surrounding word. The “target” word is the middle word. The “window …
·5445 words·26 mins· loading
Research & Academia Business & Career Research and Academia Data Science Education AI Research Research Methods Technical Writing Education Technology
My Journey from Master to PhD in Data Science and AI # I have been in software development between 1993 to 2009. Some of these years were in senior leadership roles in delivery management, project …
·4066 words·20 mins· loading
Language Models (LLMs) AI Hardware & Infrastructure AI/ML Models Language Models (LLMs) AI Model Optimization Deep Learning (DL) AI Model Architecture
Compressing Large Language Model # Is this article for me? # If you are looking answers to following question then “Yes” What is LLM compression? Why is LLM compression necessary? What …
·1143 words·6 mins· loading
Software Development Development Environment & Tools Learning Resources Technical Writing Research Tools Research and Academia
In the realm of document typesetting and preparation, LaTeX stands as a timeless giant, revered by professionals, researchers, students, and publishers alike. With its unmatched typographic quality, …
·1483 words·7 mins· loading
Databases AI/ML Models Data Science Vector Databases Machine Learning Concepts Information Retrieval Language Models (LLMs) Machine Learning (ML) Data Storage
What is pinecone? # Pinecone is a managed vector database that provides vector search (or “similarity search”) for developers with a straightforward API and usage-based pricing. It’s free to try. …
·296 words·2 mins· loading
Machine Learning (ML) Development Environment & Tools Software Development Machine Learning Development ML Frameworks Deep Learning (DL) AI Model Architecture MLOps Software Engineering Best Practices
ML Model Development Framework & Model Repositories # Introduction # There are hundreds of machine learning tasks. To do these tasks there are thousands of datasets created by individuals, …
·3697 words·18 mins· loading
Machine Learning (ML) AI/ML Models DevOps & MLOps Machine Learning Models AI Platforms Transfer Learning Deep Learning (DL) Computer Vision Natural Language Processing (NLP) AI Model Deployment AI Resources
ML Model Repository from Pinto0309 # Introduction # Using AI we can solve many kinds of tasks for this input can be text, structured data, image, video, audio, time-series, etc. To solve these …
·386 words·2 mins· loading
Python API Development Programming Python API Development Data Integration Web Services Programming Data Collection Development Tools
Python APIs for Data # Bing Bing is a search engine that brings together the best of search and people in your social networks to help you spend less time searching and more time doing. Api …
·1034 words·5 mins· loading
Machine Learning (ML) Data Analysis & Visualization Mathematics Machine Learning (ML) Mathematics for AI Data Analysis Mathematics Machine Learning Algorithms Statistical Methods
Distances in Machine Learning # Every sample, record, word, sentence, object, image etc in the Machine learning language is called vector. If we want to measure the similarity or dissimilarity …
·5601 words·27 mins· loading
AI/ML Research & Evaluation Learning Resources Research Methods ML Frameworks Coding Resources AI Development Machine Learning (ML) Deep Learning (DL) Computer Vision Research Papers
Paper with Code Resources # Trending Papers of 2021 # ADOP: Approximate Differentiable One-Pixel Point Rendering — Rückert et al — …
·5247 words·25 mins· loading
Artificial Intelligence (AI) AI/ML Research & Evaluation Data Science Resources Language Models (LLMs) Transformer Architecture Neural Network Architecture Research Papers AI Research Machine Learning (ML) Deep Learning (DL) Learning Resources AI Development Research Resources
Important AI Paper List # Introduciton # In almost all citations it becomes very difficult to read the title of research papers. Why? Because the contributors’ information is first and most of …
·4249 words·20 mins· loading
Machine Learning (ML) Evaluation & Metrics Machine Learning (ML) AI Model Evaluation Evaluation Metrics Data Science Machine Learning Metrics Statistical Analysis
Machine Learning Metrics # Introduction # In Machine Learning projects whether classical machine learning, deep learning, computer vision, speech processing, NLP, or any other ML project we keep …
·3295 words·16 mins· loading
Language Models (LLMs) Learning Resources Artificial Intelligence (AI) Language Models (LLMs) Deep Learning (DL) Natural Language Processing (NLP) Computer Vision AI Fundamentals Machine Learning (ML) Technical Writing
Comprehensive Glossary of LLM # I am developing this Glossary slowly at my own pace. Content on this page keep changing. Better definition, better explaination are part of my learing, my evolution …
·4096 words·20 mins· loading
Language Models (LLMs) Natural Language Processing (NLP) Artificial Intelligence (AI) Language Models (LLMs) Specific AI Models Natural Language Processing (NLP) AI Models Transformer Architecture Deep Learning (DL) Machine Learning (ML)
What is Large Language Model # Introduction # LLM stands for Large Language Model. It is a type of artificial intelligence (AI) model that is trained on a massive dataset of text and code. This …
·1772 words·9 mins· loading
Language Models (LLMs) Generative AI AI/ML Research & Evaluation AI Research Papers Natural Language Processing (NLP) Research Papers Generative AI Transfer Learning Machine Learning (ML) Language Models (LLMs) AI Research
Paper Name :- Pretrained Language Models for Text Generation: A Survey Typer of Paper:- Survey Paper Paper URL Paper title of the citations mentioned can be found at AI Papers with Heading. Use …
·1041 words·5 mins· loading
Research & Academia Learning Resources Research Methods Technical Writing Research Papers Research and Academia
How to Conduct Literature Review? # Introduction # Literature Review (LR) or Literature Survey (LS) is a process that helps you to browse the libraries, literature, articles, books, conference …
·9200 words·44 mins· loading
Natural Language Processing (NLP) Natural Language Processing (NLP) Machine Learning (ML) Text Analysis Language Models (LLMs) NLP Research Artificial Intelligence (AI)
NLP Tasks # Introduction # Processing words of any language and driving some meaning from these is as old as the human language. Recently, AI momentum is taking on many of these language-processing …
·525 words·3 mins· loading
Databases Mathematics SQL Databases Databases Database Theory Data Management
SQL and Relational Algebra # Relational algebra (RA) is considered as a procedural query language where the user tells the system to carry out a set of operations to obtain the desired results. i.e. …
·1231 words·6 mins· loading
Communication Research Methods Critical Thinking Career Development
Types of Questions # Introduction # Question-Answering task is one of the tasks in NLP-Task. To create a high-performing AI system that can understand the question correctly and answer appropriately, …
·2525 words·12 mins· loading
Cloud Computing API Development GCP Cloud API Development Cloud Computing Cloud Services Development Environment Cloud Integration Cloud Development
Google Cloud APIs # Introduction # Hundreds of services from Google are available to consumers as API. Every API has a specific purpose. Over a period of time, google keeps clubbing these API …
·739 words·4 mins· loading
AI Hardware & Infrastructure AI/ML Models Cloud Computing GCP Cloud Google AI Platform Machine Learning (ML) AI Model Optimization Cloud AI Services MLOps
Tuning Large Language Model with VertexAI # Why Model Tuning? # Tuning is required when you want the model to learn something niche or specific that deviates from general language patterns. Goal of …
·2155 words·11 mins· loading
Prompt Engineering Artificial Intelligence (AI) Natural Language Processing (NLP) Language Models (LLMs) Prompt Engineering Artificial Intelligence (AI) Natural Language Processing (NLP) Specific AI Models Human-AI Interaction
Introduction to Prompt Best Engineering # Prompts can contain questions, instructions, contextual information, examples, and partial input for the model to complete or continue. After the model …