2023 | 6.S898 Deep Learning Blogs 2023

Dec 19, 2023	Investigating Vision Transformer-Based Models for Closure Modeling of Fluid Dynamical Systems
Dec 12, 2023	Are Watermarked Large Language Models More Prone to Hallucinations?
Dec 12, 2023	Predicting the Future: LSTM vs Transformers for Time Series Modeling
Dec 12, 2023	Studying the benefits and limitations of sparse auto-encoders for compositional reasoning tasks
Dec 12, 2023	Solvent Encoding for solubility prediction using GNN
Dec 12, 2023	6.s898 Final Project- Investigating the biological underpinnings of latent embeddings for scRNA-seq
Dec 12, 2023	Forbidden Facts
Dec 12, 2023	Modeling Elephantfish Communication through Deep RNNs
Dec 12, 2023	Exploring Image-Supervised Contrastive Diffusion - A Comparative Analysis with Applications in Image-to-Video Generation
Dec 12, 2023	Combining Modalities for Better Molecular Representation Learning
Dec 12, 2023	Exploring Frobenius and Spectral Normalization in MLPs and Residual networks
Dec 12, 2023	Iterated Representation Learning
Dec 12, 2023	A Method for Alleviating Catastrophic Forgetting With Explainability
Dec 12, 2023	Graph Articulated Objects
Dec 12, 2023	Physics Loss
Dec 12, 2023	Diffusion Models on Low-Brightness Images
Dec 12, 2023	Semi-Supervised Domain Adaptation using Diffusion Models
Dec 12, 2023	The Effect of Activation Functions On Superposition in Toy Models
Dec 12, 2023	Stable Diffusion for Oracle Bone Script
Dec 12, 2023	Gradient-Boosted Neural Wavlet Interpolation for Time Series (G-BiTS)
Dec 12, 2023	Challenges in Deep Learning Surrogates for Constrained Linear Optimization
Dec 12, 2023	Activation Patching in Vision Transformers
Dec 12, 2023	Transformer-Based Approaches for Hyperspectral Imagery in Remote Sensing
Dec 12, 2023	Learning Generals.io
Dec 12, 2023	A Comparative Study of transformer on long sequence time series data
Dec 12, 2023	Transfer Resistant Model Training
Dec 12, 2023	Sparse Autoencoders for a More Interpretable RLHF
Dec 12, 2023	Using Synthetic Data to Minimize Real Data Requirements
Dec 12, 2023	Applications of Deep Learning in Timbre Transfer
Dec 12, 2023	The Effect of Activation Functions On Superposition in Toy Models
Dec 12, 2023	Training Robust Networks
Dec 12, 2023	Imposing uniformity through Poisson flow models
Dec 12, 2023	6-DOF estimation through visual place recognition
Dec 12, 2023	Tracing the Seeds of Conflict: Advanced Semantic Parsing Techniques for Causality Detection in News Texts
Dec 12, 2023	To Encode or Not To Encode: The Case for the Encoder-free Autodecoder Architecture
Dec 12, 2023	New Synthesis Approach for Personalized LLMS
Dec 12, 2023	Augmenting Expert Domain Image Inputs for Enhancing Visual Language Models Performance
Dec 12, 2023	Embeddings for Spatio-temporal Forecasting
Dec 12, 2023	In the pursuit of cheap and robust word embeddings
Dec 12, 2023	Leveraging Representation Engineering For LLM’s In-Context-Learning
Dec 12, 2023	Reasoning with Maps: Assessing Spatial Comprehension on Maps in Pre-trained Models
Dec 12, 2023	Autoen-chorder: Predicting Musical Success With Neural Nets
Dec 12, 2023	Ensemble Learning for Mitigating Double Descent
Dec 12, 2023	Injecting Node Information via Embedding Initializations
Dec 11, 2023	Overparameterization of Neural Networks through Kernel Regression and Gaussian Processes
Dec 11, 2023	Exploring Methods for Generating Music
Dec 11, 2023	Can Constrastive Learning Recommend Me a Movie?
Dec 11, 2023	Improving CLIP Spatial Awareness Using Hard Negative Mining
Dec 11, 2023	Multimodal Commonsense
Dec 11, 2023	Exploring Univariate Time Series Anomaly Detection using VAE's
Dec 11, 2023	Graph Transformers
Dec 11, 2023	Learning a Lifted Linearization for Switched Dynamical Systems
Dec 10, 2023	Sparse Autoencoder Universality - Under What Conditions are Learned Features Consistent?
Dec 10, 2023	Optimizations of Transformers for Small-scale Performance
Dec 10, 2023	Guided Transfer Learning and Learning How to Learn: When Is It Useful?
Dec 9, 2023	Alive Scene
Dec 5, 2023	Projected fast feedforward networks
Dec 1, 2023	Understanding Linear Mode Connectivity
Dec 1, 2023	Transformers vs. RNNs: How do findings from real-world datasets relate to the theory?
Dec 1, 2023	Exploring the latent space of text-to-image diffusion models
Nov 16, 2023	Accelerating large model inference with speculative decoding - 6.s898
Nov 11, 2023	Unraveling Social Reasoning in LLMs: A Deep Dive into the Social IQA Benchmark
Nov 11, 2023	Comparing data augmentation using VAEs and denoising-VAEs for limited noisy datasets
Nov 10, 2023	Emoji3Vec
Nov 10, 2023	Modeling Human Speech Recognition with Different Network Architectures
Nov 9, 2023	Analytic, Empirical, and Monte Carlo Bayesian Methods for Uncertainty Estimation
Nov 9, 2023	Understanding LLM Attention on Useless Numbers in Word Problems (and this Title has 8 Es)
Nov 9, 2023	Cross-Lingual Fine-Tuning for Multilingual Text Embeddings
Nov 9, 2023	Learning Interpretable Features with Sparse Auto-Encoders
Nov 9, 2023	How does model size impact catastrophic forgetting in online continual learning?
Nov 9, 2023	VGAE Clustering of the Fruit Fly Connectome
Nov 9, 2023	Robust Image to Video Generation Using Contrastive Diffusion Over Latents
Nov 9, 2023	Adaptive Controller with Neural Net Equations of Motion for High-DOF Robots
Nov 9, 2023	Robustness of self-supervised ViT features in b-mode images
Nov 9, 2023	Investigating the Impact of Symmetric Optimization Algorithms on Learnability
Nov 9, 2023	Can CNN learn shapes?
Nov 8, 2023	Quantum Circuit Optimization with Graph Neural Nets
Nov 8, 2023	Structural vs Data Inductive Bias
Nov 8, 2023	From Scroll to Misbelief - Modeling the Unobservable Susceptibility to Misinformation on Social Media
Nov 8, 2023	Examining assumptions in scRNA-seq foundation model pre-training (6.S898 Final Project)
Nov 8, 2023	Increasing Context Length For Transformers
Nov 8, 2023	Zero-Shot Machine-Generated Image Detection using Sinks of Gradient Flows
Nov 8, 2023	Denoising EMG signals
Nov 8, 2023	A Deeper Look into Equivariance for Materials Data
Nov 7, 2023	Prompt to Prompt
Nov 7, 2023	Understanding Bias in Speech to Text Language Models
Nov 6, 2023	Regularization Techniques for Attention Layers in Transformer Models
Nov 5, 2023	Neural PDEs for learning local dynamics and longer temporal rollouts
Nov 1, 2023	Graph neural networks v.s. transformers for geometric graphs