Generative AI and LLMs : Faculty development Program at MBM University (CSE Dept), dt. 28-03-2026

Artificial Intelligence (AI) focuses on building systems capable of performing tasks that require human intelligence, such as learning from data, pattern recognition, natural language understanding, and decision-making. Over time, AI has evolved from rule-based systems to machine learning, deep learning, and most recently, generative AI and foundation models, driven by large datasets, powerful GPUs, and improved neural architectures.

Generative AI models learn the underlying data distribution and aim to approximate it to generate new, realistic samples. These models are widely used in text generation, image synthesis, code generation, and speech generation. In contrast, discriminative models learn conditional probabilities for tasks such as classification and prediction.

P(x₁, x₂, ..., x_T) = ∏ P(x_t | x<t)

Language models assign probabilities to sequences of words and are trained using cross-entropy loss to predict the next token effectively. Early sequence models like Recurrent Neural Networks (RNNs) introduced hidden states to capture temporal dependencies but suffered from limitations such as sequential computation, vanishing gradients, and difficulty in modeling long-range dependencies.

The Transformer architecture addressed these challenges using self-attention mechanisms, enabling parallel computation and better context modeling.

Attention(Q, K, V) = softmax((QK^T / √d_k)) V

Large Language Models (LLMs), such as GPT and BERT, are built on transformer architectures and trained on massive datasets. GPT uses an autoregressive approach for text generation, while BERT employs bidirectional context for language understanding. These models demonstrate capabilities like translation, summarization, reasoning, and code generation.

Transformers are widely applied in natural language processing, speech systems, computer vision, and multimodal learning. However, LLMs have limitations, including hallucination, bias, high computational cost, and lack of consistent reasoning. Ethical concerns such as fairness, misinformation, and privacy must also be addressed.

Future research directions include efficient transformers, retrieval-augmented generation, neuro-symbolic models, explainable AI, and domain-specific models. Overall, transformers and LLMs have revolutionized AI while presenting challenges for responsible development.