MLSys/FM

In top venues

Complete list

Refer to Google Scholar and DBLP for more updated information.





Filters
Categories
Search
  • DeepZoning: Re-accelerate CNN Inference with Zoning Graph for Heterogeneous Edge Cluster

    DeepZoning: Re-accelerate CNN Inference with Zoning Graph for Heterogeneous Edge Cluster

    Jingyu Wang, Ruilong Ma, Xiang Yang, Qi Qi, Zirui Zhuang, Jing Wang, Jianxin Liao, and Song Guo

    [

    TACO

    ]

  • FM-Delta: Lossless Compression for Storing Massive Fine-tuned Foundation Models

    FM-Delta: Lossless Compression for Storing Massive Fine-tuned Foundation Models

    Wanyi Ning, Jingyu Wang, Qi Qi, Mengde Zhu, Haifeng Sun, Daixuan Cheng, Jianxin Liao, Ce Zhang

    [

    NeurIPS

    ]

  • PICO: Pipeline Inference Framework for Versatile CNNs on Diverse Mobile Devices

    PICO: Pipeline Inference Framework for Versatile CNNs on Diverse Mobile Devices

    Xiang Yang, Zikang Xu, Qi Qi, Jingyu Wang, Haifeng Sun, Jianxin Liao, Song Guo

    [

    TMC

    ]

  • PipeLLM: Pipeline LLM Inference on Heterogeneous Devices with Sequence Slicing

    PipeLLM: Pipeline LLM Inference on Heterogeneous Devices with Sequence Slicing

    Ruilong Ma, Jingyu Wang, Qi Qi, Xiang Yang, Haifeng Sun, Zirui Zhuang, Jianxin Liao

    [

    SIGCOMM

    ]

  • Following the Correct Direction: Renovating Sparsified SGD towards Global Optimization in Distributed Edge Learning

    Following the Correct Direction: Renovating Sparsified SGD towards Global Optimization in Distributed Edge Learning

    Wanyi Ning, Haifeng Sun, Xiaoyuan Fu, Xiang Yang, Qi Qi, Jingyu Wang, Jianxin Liao, Zhu Han

    [

    JSAC

    ]

  • OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks

    OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks

    Jiashi Li, Qi Qi, Jingyu Wang, Ce Ge, Zhangzhang Yue, Haifeng Sun

    [

    CVPR

    ]