Blog

Paper Reading #2: XLNet Explained

Paper Reading #2: XLNet Explained

One of the most celebrated, recent advancements in language understanding is the XLNet model from Carnegie Mellon University and Google. It takes the "best-of-both-worlds" approach by combining auto-encoding and autoregressive language modeling to achieve...

read more

Why is zero_grad() Called in PyTorch?

Contents Introduction Gradients in Neural Networks Backpropagation and Gradient Descent Without zero_grad() With zero_grad() Plotting Losses Monitoring Loss Summary When training neural networks in PyTorch, calling zero_grad() is essential before backpropagating...

read more
Profile Picture
Senior Advisor, Data Science | [email protected] | + posts

Suf is a senior advisor in data science with deep expertise in Natural Language Processing, Complex Networks, and Anomaly Detection. Formerly a postdoctoral research fellow, he applied advanced physics techniques to tackle real-world, data-heavy industry challenges. Before that, he was a particle physicist at the ATLAS Experiment of the Large Hadron Collider. Now, he’s focused on bringing more fun and curiosity to the world of science and research online.