Data Science Blog
loading
·
Welcome to my Data Science Blog!
·1092 words·6 mins·
loading
Language Models (LLMs)
Business & Career
Societal Impact
Artificial Intelligence (AI)
Career Development
AI Capabilities
AI Integration
LLM Skills and Human Skills # What is skill? # A skill is the ability to perform a task or activity effectively and efficiently. It involves applying knowledge, experience, and techniques to achieve …
·2785 words·14 mins·
loading
Language Models (LLMs)
AI/ML Models
Software Architecture & Design
Language Models (LLMs)
Deep Learning (DL)
AI Model Training
Neural Networks
AI Development
Machine Learning (ML)
Transformer Models
Understanding LLM Architectures and Model Training # Large Language Models (LLMs) are transforming the field of artificial intelligence by enabling machines to understand and generate human language …
·1531 words·8 mins·
loading
Language Models (LLMs)
AI Ethics & Governance
Cybersecurity
AI Ethics and Governance
Data Security
Data Privacy
AI Safety
ML Security
LLM Security and Ethics Considerations # Question: For my client’s highly secured data like health industry data, banking, insurnace, internal security, etc. data can I use gpt4 for finetuning? …
·2968 words·14 mins·
loading
Language Models (LLMs)
AI/ML Models
Machine Learning (ML)
LLM Fine-Tuning
Machine Learning (ML)
AI Model Training
AI Model Customization
Language Models (LLMs)
Transfer Learning
Neural Networks
Finetuning, Fewshot Learning, Why and How? # Why to finetune a LLM? # Fine-tuning a large language model (LLM) can provide several benefits, depending on your specific needs and objectives. Here are …
·1693 words·8 mins·
loading
Data Science
Programming
Computer Science Fundamentals
Computer Science Fundamentals
Text Processing
Software Development
Technology Standards
Programming
Data Fundamentals
What is Unicode and how does it works? # What is Unicode: A Universal Character Set # Unicode is a standard that assigns a unique number to every character, no matter the platform, program, or …
·1033 words·5 mins·
loading
AI/ML Models
Language Models (LLMs)
Technology Trends & Future
Language Models (LLMs)
Specific AI Models
AI Models
AI Model Training
Open Source AI
Machine Learning (ML)
Stanford Alpaca # Introduction # Stanford Alpaca Github Report
Stanford Alpaca is An “Instruction-following” LLaMA Model This is the repo aims to build and share an instruction-following …
·1972 words·10 mins·
loading
Cybersecurity
Software Development
Cybersecurity
Software Development
Data Security
Software Security Concepts # As of today, this article is answering following questions related to software security. In future, I will add more concepts here.
What are the major ideas which we need …
·2064 words·10 mins·
loading
Language Models (LLMs)
Generative AI
AI/ML Models
Language Models (LLMs)
Generative AI
Transformer Models
Deep Learning (DL)
AI Architecture
Neural Networks
Machine Learning (ML)
Understanding LLM, GAN and Transformers # LLM Layers # Large Language Models (LLMs) are typically based on Transformer architectures, which consist of several types of layers that work together to …
·7314 words·35 mins·
loading
AI/ML Models
Artificial Intelligence (AI)
Transformer Models
Deep Learning (DL)
Natural Language Processing (NLP)
Neural Networks
Machine Learning (ML)
AI Architecture
Transformer Architecture
Transformers Demystified A Step-by-Step Guide # All modern Transformers are based on a paper “Attention is all you need”
Introduction # This was the mother paper of all the transformer …
·2786 words·14 mins·
loading
Data Analysis & Visualization
Machine Learning (ML)
Data Science
Data Science
Machine Learning (ML)
Data Visualization
Dimensionality Reduction
Data Analysis
Dimensionality Reduction and Visualization # What are the popular methods of dimensionality reduction? # Dimensionality reduction is a crucial step in data preprocessing, particularly when dealing …
·712 words·4 mins·
loading
Databases
Cloud Computing
DevOps & MLOps
Serverless Computing
Cloud Databases
Database Management
Cloud Computing
Data Storage
Cloud Architecture
Databases
Serverless databases # A serverless database is a type of database service that automatically manages infrastructure and scaling, allowing developers to focus solely on building applications without …
·1636 words·8 mins·
loading
Healthcare Technology
AI Applications
Healthcare AI
Healthcare Technology
AI in Health Care # Human health is paramount for inviduals, governments, hospitals and other related systems. There are powerful laws and heavy panelty for the violations for these laws. In the …
·1346 words·7 mins·
loading
Cybersecurity
Algorithms
Computer Science Fundamentals
Cybersecurity
Data Security
Data Quality
All about Hashing # What is Hashing function? # A hashing function is a mathematical algorithm that converts an input (or “message”) into a fixed-size string of bytes, typically a hash …
·1074 words·6 mins·
loading
Containerization
Software Development
Docker
Containerization
DevOps
Development Tools
Deployment
Creating Docker Image # What is Docker? # Docker is an open platform for developing, shipping, and running applications. Docker enables you to separate your applications from your infrastructure so …
·2404 words·12 mins·
loading
API Development
Web Development
API Development
Web Development
Web Services
Software Architecture
Backend Development
Networking
REST API # An API (Application Programming Interface) is a set of rules and protocols that allows one software application to interact with another. It defines the methods and data formats that …
·1589 words·8 mins·
loading
Natural Language Processing (NLP)
AI/ML Research & Evaluation
Evaluation & Metrics
Natural Language Processing (NLP)
Machine Learning (ML)
AI Benchmarks
Language Models (LLMs)
Evaluation
NLP Evaluation
AI Model Evaluation
NLP BenchMarks # What is Language Model? # A language model is a computational model that understands and generates human language. It learns the patterns and structure of a language by analyzing …
·1122 words·6 mins·
loading
IT Infrastructure
System Administration
Operating Systems
System Administration
File Management
Software Development
Decoding Windows User Folder # My machine c:\users folder has 3 users Detault, harip, public. What is the purpose of these 3 users? # The C:\Users folder on a Windows machine contains subfolders for …
·2871 words·14 mins·
loading
Python
Programming
Development Environment & Tools
Python
Software Development Tools
Development Tools
Development Environment
Software Development
Decoding pip install operations # Your draft provides useful insights into using pip for Python package management. Here’s a refined version of your article with improved structure, grammar, …
·4484 words·22 mins·
loading
Containerization
Development Environment & Tools
Programming
Docker
Containerization
DevOps
Software Development
Development Tools
System Administration
Decoding docker commands # Is this article for me? # If you are coming from IT Infrastructure background and have solid experience in containerization you can skip this. But if you are seeking to …
·24 words·1 min·
loading
Data Science
Natural Language Processing (NLP)
Interdisciplinary Topics
Corpus
AI Use Cases
Literature
Sanskrit
NLP Concepts
Digital Humanities
Ancient Texts
Manmath Nath - Ramayana Corpus # Corpus Introduction Corpus License Bala Kanda Ayodhya Kanda Aranyaka Kanda Kishkinddha Kanda Sunder Kanda Lanka Kanda Utter Kanda
·133 words·1 min·
loading
Data Science
Natural Language Processing (NLP)
Interdisciplinary Topics
Corpus
Literature
Sanskrit
NLP Concepts
Digital Humanities
Ancient Texts
KM Ganguli Mahabharat Corpus # Adi Parva (The Book of the Beginning) Sabha Parva (The Book of the Assembly Hall) Vana Parva or Aranyaka-Parva (The Book of the Forest) Virata Parva (The Book of …
·547 words·3 mins·
loading
AI Applications
Industry Applications
AI in Government
Public Services
Digital Transformation
Urban Technology
E-Governance
AI Usecases in Government # Preliminary Work before any AI Project with Government # Keeping technology evolution, cost of latest technologies, government expectations in mind one should do following …
·2044 words·10 mins·
loading
Education Technology
AI Applications
AI in Education
Education Technology
AI in School Education # Introduction # In the ever-evolving landscape of education, a technological revolution is quietly reshaping the way students learn, teachers instruct, and schools operate. At …
·3128 words·15 mins·
loading
Data Science
Astronomy
Ancient Texts
Predictive Analytics
Basics of Jyotish # Introduction # In this article I am going to discuss the basics of astrology and the data science aspect of astrology. I am defending here a big case of model retraining and …
·116 words·1 min·
loading
Philosophy & Cognitive Science
Learning Resources
Business & Career
Personal Development
Learning Resources
Download Link to this Diary
The Inspirational Leader by Gifford Thomas The 5 Elements of Effective Thinking by Edward B. Burger How to Listen by Trimboli Self-discipline in 10 Days by Theodore Bryant …
·949 words·5 mins·
loading
Artificial Intelligence (AI)
Natural Language Processing (NLP)
AI Applications
Natural Language Processing (NLP)
Language Models (LLMs)
Text Analysis
Machine Learning (ML)
NLP Applications
Text Processing
Empowering-Language-with-AI-NLP-Capabilities # Introduction # When envisioning artificial intelligence (AI), the initial images that often come to mind are humanoid robots. However, this perception …
·225 words·2 mins·
loading
Natural Language Processing (NLP)
Data Analysis & Visualization
AI/ML Models
Language Models (LLMs)
NLP Applications
Text Analysis
Natural Language Processing (NLP)
Machine Learning (ML)
Text Mining
Topic Modeling with BERT # Key steps in BERTopic modelling are as following.
Use “Sentence Embedding” models to embed the sentences of the article Reduce the dimensionality of embedding …
·585 words·3 mins·
loading
Prompt Engineering
Artificial Intelligence (AI)
Prompt Engineering
Language Models (LLMs)
AI Reasoning
Neural Networks
Machine Learning (ML)
Cognitive Computing
Artificial Intelligence (AI)
Graph of Thoughts # This is a valuable resource for learning Graph of Thoughts (GoT) concepts. The YouTube video is from code_your_own_AI. I’m utilizing the comments made by @wesleychang2005 on …
·2238 words·11 mins·
loading
Natural Language Processing (NLP)
AI/ML Models
Data Science
NLP Concepts
Natural Language Processing (NLP)
Data Representation
Text Processing
Machine Learning (ML)
Neural Networks
Language Models (LLMs)
Basics of Word Embedding # What is Context, target and window? # The “context” word is the surrounding word. The “target” word is the middle word. The “window …
·5445 words·26 mins·
loading
Research & Academia
Business & Career
Research and Academia
Data Science Education
AI Research
Research Methods
Technical Writing
Education Technology
My Journey from Master to PhD in Data Science and AI # I have been in software development between 1993 to 2009. Some of these years were in senior leadership roles in delivery management, project …
·4066 words·20 mins·
loading
Language Models (LLMs)
AI Hardware & Infrastructure
AI/ML Models
Language Models (LLMs)
AI Model Optimization
Deep Learning (DL)
AI Model Architecture
Compressing Large Language Model # Is this article for me? # If you are looking answers to following question then “Yes”
What is LLM compression? Why is LLM compression necessary? What …
·1143 words·6 mins·
loading
Software Development
Development Environment & Tools
Learning Resources
Technical Writing
Research Tools
Research and Academia
In the realm of document typesetting and preparation, LaTeX stands as a timeless giant, revered by professionals, researchers, students, and publishers alike. With its unmatched typographic quality, …
·1483 words·7 mins·
loading
Databases
AI/ML Models
Data Science
Vector Databases
Machine Learning Concepts
Information Retrieval
Language Models (LLMs)
Machine Learning (ML)
Data Storage
What is pinecone? # Pinecone is a managed vector database that provides vector search (or “similarity search”) for developers with a straightforward API and usage-based pricing. It’s free to try. …
·296 words·2 mins·
loading
Machine Learning (ML)
Development Environment & Tools
Software Development
Machine Learning Development
ML Frameworks
Deep Learning (DL)
AI Model Architecture
MLOps
Software Engineering
Best Practices
ML Model Development Framework & Model Repositories # Introduction # There are hundreds of machine learning tasks. To do these tasks there are thousands of datasets created by individuals, …
·3697 words·18 mins·
loading
Machine Learning (ML)
AI/ML Models
DevOps & MLOps
Machine Learning Models
AI Platforms
Transfer Learning
Deep Learning (DL)
Computer Vision
Natural Language Processing (NLP)
AI Model Deployment
AI Resources
ML Model Repository from Pinto0309 # Introduction # Using AI we can solve many kinds of tasks for this input can be text, structured data, image, video, audio, time-series, etc. To solve these …
·386 words·2 mins·
loading
Python
API Development
Programming
Python
API Development
Data Integration
Web Services
Programming
Data Collection
Development Tools
Python APIs for Data # Bing Bing is a search engine that brings together the best of search and people in your social networks to help you spend less time searching and more time doing.
Api …
·1034 words·5 mins·
loading
Machine Learning (ML)
Data Analysis & Visualization
Mathematics
Machine Learning (ML)
Mathematics for AI
Data Analysis
Mathematics
Machine Learning Algorithms
Statistical Methods
Distances in Machine Learning # Every sample, record, word, sentence, object, image etc in the Machine learning language is called vector. If we want to measure the similarity or dissimilarity …
·5601 words·27 mins·
loading
AI/ML Research & Evaluation
Learning Resources
Research Methods
ML Frameworks
Coding Resources
AI Development
Machine Learning (ML)
Deep Learning (DL)
Computer Vision
Research Papers
Paper with Code Resources # Trending Papers of 2021 # ADOP: Approximate Differentiable One-Pixel Point Rendering — Rückert et al — …
·5247 words·25 mins·
loading
Artificial Intelligence (AI)
AI/ML Research & Evaluation
Data Science Resources
Language Models (LLMs)
Transformer Architecture
Neural Network Architecture
Research Papers
AI Research
Machine Learning (ML)
Deep Learning (DL)
Learning Resources
AI Development
Research Resources
Important AI Paper List # Introduciton # In almost all citations it becomes very difficult to read the title of research papers. Why? Because the contributors’ information is first and most of …
·4249 words·20 mins·
loading
Machine Learning (ML)
Evaluation & Metrics
Machine Learning (ML)
AI Model Evaluation
Evaluation Metrics
Data Science
Machine Learning Metrics
Statistical Analysis
Machine Learning Metrics # Introduction # In Machine Learning projects whether classical machine learning, deep learning, computer vision, speech processing, NLP, or any other ML project we keep …
·3295 words·16 mins·
loading
Language Models (LLMs)
Learning Resources
Artificial Intelligence (AI)
Language Models (LLMs)
Deep Learning (DL)
Natural Language Processing (NLP)
Computer Vision
AI Fundamentals
Machine Learning (ML)
Technical Writing
Comprehensive Glossary of LLM # I am developing this Glossary slowly at my own pace. Content on this page keep changing. Better definition, better explaination are part of my learing, my evolution …
·4096 words·20 mins·
loading
Language Models (LLMs)
Natural Language Processing (NLP)
Artificial Intelligence (AI)
Language Models (LLMs)
Specific AI Models
Natural Language Processing (NLP)
AI Models
Transformer Architecture
Deep Learning (DL)
Machine Learning (ML)
What is Large Language Model # Introduction # LLM stands for Large Language Model. It is a type of artificial intelligence (AI) model that is trained on a massive dataset of text and code. This …
·1772 words·9 mins·
loading
Language Models (LLMs)
Generative AI
AI/ML Research & Evaluation
AI Research Papers
Natural Language Processing (NLP)
Research Papers
Generative AI
Transfer Learning
Machine Learning (ML)
Language Models (LLMs)
AI Research
Paper Name :- Pretrained Language Models for Text Generation: A Survey
Typer of Paper:- Survey Paper
Paper URL
Paper title of the citations mentioned can be found at AI Papers with Heading. Use …
·1041 words·5 mins·
loading
Research & Academia
Learning Resources
Research Methods
Technical Writing
Research Papers
Research and Academia
How to Conduct Literature Review? # Introduction # Literature Review (LR) or Literature Survey (LS) is a process that helps you to browse the libraries, literature, articles, books, conference …
·9200 words·44 mins·
loading
Natural Language Processing (NLP)
Natural Language Processing (NLP)
Machine Learning (ML)
Text Analysis
Language Models (LLMs)
NLP Research
Artificial Intelligence (AI)
NLP Tasks # Introduction # Processing words of any language and driving some meaning from these is as old as the human language. Recently, AI momentum is taking on many of these language-processing …
·525 words·3 mins·
loading
Databases
Mathematics
SQL Databases
Databases
Database Theory
Data Management
SQL and Relational Algebra # Relational algebra (RA) is considered as a procedural query language where the user tells the system to carry out a set of operations to obtain the desired results. i.e. …
·1231 words·6 mins·
loading
Communication
Research Methods
Critical Thinking
Career Development
Types of Questions # Introduction # Question-Answering task is one of the tasks in NLP-Task. To create a high-performing AI system that can understand the question correctly and answer appropriately, …
·2525 words·12 mins·
loading
Cloud Computing
API Development
GCP Cloud
API Development
Cloud Computing
Cloud Services
Development Environment
Cloud Integration
Cloud Development
Google Cloud APIs # Introduction # Hundreds of services from Google are available to consumers as API. Every API has a specific purpose. Over a period of time, google keeps clubbing these API …
·739 words·4 mins·
loading
AI Hardware & Infrastructure
AI/ML Models
Cloud Computing
GCP Cloud
Google AI Platform
Machine Learning (ML)
AI Model Optimization
Cloud AI Services
MLOps
Tuning Large Language Model with VertexAI # Why Model Tuning? # Tuning is required when you want the model to learn something niche or specific that deviates from general language patterns.
Goal of …
·2155 words·11 mins·
loading
Prompt Engineering
Artificial Intelligence (AI)
Natural Language Processing (NLP)
Language Models (LLMs)
Prompt Engineering
Artificial Intelligence (AI)
Natural Language Processing (NLP)
Specific AI Models
Human-AI Interaction
Introduction to Prompt Best Engineering # Prompts can contain questions, instructions, contextual information, examples, and partial input for the model to complete or continue. After the model …