Sterling, Virginia, United States
Neuroscale AI operates as an affiliated entity of Intellectual Point, collaborating to advance AI-driven decision-making solutions.
AI Agents for Decision Making: DeepSpeed, PyTorch, vLLM, LangChain, PGVector, RAG, Multimodal
• Developed Arbi, an AI copilot using AI Agents to evaluate RFPs, RFIs, resumes, and customer call transcripts.
• Extracted data from PDFs and images using Qwen-2.5-VL, handling complex structures and interpreting diagrams.
• Fine-tuned QwQ/Qwen-32B with PyTorch, DeepSpeed for memory-efficient distributed multi-GPU training,
achieving 10x speedup, and LoRA for PEFT, enhancing reasoning capabilities via knowledge distillation.
• Applied Quantization Aware Training to models, reducing size and compute by 4 times with minimal loss.
• Leveraged vLLM for optimized inference, continuous batching, and low-latency high-throughput deployment.