AI Model Optimization Part-1 by Applied Mathematician

1 minute read

Published: February 20, 2026

AI Model Optimization is an interdisciplinary field that involves collaboration between researchers, system engineers, and hardware architects to make models better, cheaper, and faster with the goal of maximizing throughput and minimizing computational costs and latency.

Large Scale AI Models

What is AI model ?

What is slowing the AI model most ? There are two bottleneck problems every AI Engineer encounters due to hardware constraints

Compute Bound : How many mathematical operation can be performed/executed per second(FLOPs) ? Ex- Prefill phase of LLMs are compute intensive task, where we needed to perform high dim matrix multiplications.
Memory Bandwidth Bound : How fast data can move between memory and processor ? Ex- Decode phase of LLMs generate next token one after another autoregressively needed to load the model weights repeatedly. Both of these bottlenecks affects the our goal of maximizing throughput and minimizing latency/computational cost for LLMs.

We needed to understand the hardware requirements for our llms vica versa ? like llms parameter counting and VRAM requirements to ensure that llms fits without physical constraints of hardware.

How to calculate the total trainable parameters(size) of the llms ?

How much minimum VRAM needed to load the our llms ?

What are the Possible AI Model Optimization Problems ?

How can we reduce the model size without altering its performance ?
How can we optimize transformer architecture for compute and memory ?
How can we utilize the distributed systems ?
How can we deploy models at scale ?

Reference

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

अत्रेय महाकाव्य: शून्य से सृजन और आजमगढ़ की पावन कर्मभूमि की अमर गाथा

1 minute read

Published: January 31, 2026

क्या आपने कभी उस ‘शून्य’ के बारे में सोचा है जहाँ से इस अनंत ब्रह्मांड का उदय हुआ? क्या आप जानते हैं कि आजमगढ़ की पवित्र नदियाँ—तमसा, मंजूषा और कुँवर—किन महान ऋषियों की साधना की साक्षी रही हैं?

A Case Study of MacBook Air M4(16/256GB) for AI Model Production by Applied Mathematician

4 minute read

Published: January 29, 2026

I was using Asus Notebook(i3/8GB/1TB-HHD) since 2021. After completion of M.Sc Mathematics and Computing degree at BHU Varanasi, went to the IIT Delhi(Cloud Computing and HIPC Lab SIT), New Delhi for the AI Research Internship. I am greatfull enough that i have worked onto the the two most interesting research problem

LLM for OpenAPI Specification Generation from the given server source code.

Lean4 Proof generation for the given theorem in lean4

Ajeet Kumar

AI Model Optimization Part-1 by Applied Mathematician

Table of Contents

Large Scale AI Models

Reference

Share on

You May Also Enjoy

Course-AI : Academic Journey to The Industry

अत्रेय महाकाव्य: शून्य से सृजन और आजमगढ़ की पावन कर्मभूमि की अमर गाथा

Notebook : Chanakya(Kautilya) Niti Sutra

A Case Study of MacBook Air M4(16/256GB) for AI Model Production by Applied Mathematician