Papers - IoT
updated
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
• 2402.14905
• Published
• 134
Sensor-based Multi-Robot Search and Coverage with Spatial Separation in
Unstructured Environments
Paper
• 2403.01710
• Published
• 2
EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models
Paper
• 2308.14352
• Published
Slimmable Encoders for Flexible Split DNNs in Bandwidth and Resource
Constrained IoT Systems
Paper
• 2306.12691
• Published
• 3
Bias Loss for Mobile Neural Networks
Paper
• 2107.11170
• Published
• 2
MicroNAS: Memory and Latency Constrained Hardware-Aware Neural
Architecture Search for Time Series Classification on Microcontrollers
Paper
• 2310.18384
• Published
• 2
Pattern Discovery in Time Series with Byte Pair Encoding
Paper
• 2106.00614
• Published
• 2
Towards a World-English Language Model for On-Device Virtual Assistants
Paper
• 2403.18783
• Published
• 6
Transformer-Lite: High-efficiency Deployment of Large Language Models on
Mobile Phone GPUs
Paper
• 2403.20041
• Published
• 34
Octopus v2: On-device language model for super agent
Paper
• 2404.01744
• Published
• 58
LLM in a flash: Efficient Large Language Model Inference with Limited
Memory
Paper
• 2312.11514
• Published
• 260
Octopus v4: Graph of language models
Paper
• 2404.19296
• Published
• 118