attention mechanism

April 2025
M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

How a Large Language Model (LLM) predicts the next word

charles3 months ago3 months ago012 mins

How a Large Language Model (LLM) predicts the next word, including all the mathematical operations involved at each step, with the appropriate vector and tensor manipulations.

Large Language Models

charles4 months ago035 mins

Course on Large Language Models NOTE: You’re only meant to change code marked with “# TODO:” Table of Contents Setting Up API Key Configuration Connecting to OpenAI API Exploring the API Creating Chat Completions Understanding Completion Parameters Prompt Engineering Crafting Effective Prompts Strategies and Best Practices Advanced Techniques Utilizing Embeddings Function Calling in LLMs Extras…