• Skip to primary navigation
  • Skip to content
  • Skip to footer
About deep2Read
  • PostsByYear
  • PostsByTags
  • PostsByCategory
  • BasicDeepLearning
  • DetailedByTerm
  • DetailedByCategory
  • DetailedByTag

    Dr. Yanjun Qi

    Here are Papers I Reviewed. I am science curious.

    • MyHomePage
    • Twitter
    • GitHub

    Reviews Indexed

    • Index
      • PostsByYear
      • PostsByTags
      • PostsByCategory
    • Recent Posts By GenAI Category
      • FM Basic
      • FM Adapt
      • FM Risk
      • FM Multi
      • FM Efficiency
    • Past Posts By DNN Category
      • 0Basics
      • 1Theoretical
      • 2Architecture
      • 2GraphsNN
      • 3Reliable
      • 4Optimization
      • 5Generative
      • 6Reinforcement
      • 7MetaDomain
      • 8Scalable
      • 9DiscreteApp
    • Basic DNN Reads
      • BasicDeep
      • BasicML

    deep2reproduce 2019 Fall - 1Analysis papers

    less than 1 minute read

    On this page

    Team INDEX Title & Link Tags Our Slide
    T2 Empirical Study of Example Forgetting During Deep Neural Network Learning Sample Selection, forgetting OurSlide
    T29 Select Via Proxy: Efficient Data Selection For Training Deep Networks Sample Selection OurSlide
    T9 How SGD Selects the Global Minima in over-parameterized Learning optimization OurSlide
    T10 Escaping Saddles with Stochastic Gradients optimization OurSlide
    T13 To What Extent Do Different Neural Networks Learn the Same Representation subspace OurSlide
    T19 On the Information Bottleneck Theory of Deep Learning informax OurSlide
    T20 Visualizing the Loss Landscape of Neural Nets normalization OurSlide
    T21 Using Pre-Training Can Improve Model Robustness and Uncertainty training, analysis OurSlide
    T24 Norm matters: efficient and accurate normalization schemes in deep networks normalization OurSlide

    Tags: analysis, forgetting, generalization, informax, normalization, optimization, Sample-selection, subspace, training

    Categories: 1Theoretical

    Updated: December 12, 2019

    Twitter Facebook LinkedIn
    Previous Next

    You May Also Enjoy

    Safety Benchmark WMDP

    1 minute read

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatt...

    KV Cache and Tooling

    3 minute read

    KV Caching in LLM:

    Advanced Transformer Architectures

    25 minute read

    In this session, our readings cover:

    LLM fine tuning

    29 minute read

    In this session, our readings cover:

    • Twitter
    • GitHub
    • Feed
    © 2024 About deep2Read. Powered by Jekyll & Minimal Mistakes.