<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[50 FDE Decomposition and Case Study Problems with Solutions]]></title><description><![CDATA[<h1>50 FDE Interview Problems: Decomposition and Case Studies</h1>
<p dir="auto">The decomposition interview is the signature FDE interview format. You're given a vague business problem and must break it into technical components, propose an architecture, and discuss trade-offs — all while communicating clearly.</p>
<p dir="auto">These problems are organized by industry and difficulty. Each includes a problem statement, key questions to ask, and a suggested approach.</p>
<hr />
<h2>How to Approach Decomposition Problems</h2>
<h3>The Framework (5 steps, 45 minutes)</h3>
<ol>
<li><strong>Clarify</strong> (5 min) — Ask questions. Understand the customer, constraints, and success metrics.</li>
<li><strong>Decompose</strong> (10 min) — Break the problem into 3-5 sub-problems.</li>
<li><strong>Prioritize</strong> (5 min) — Which sub-problem delivers the most value first?</li>
<li><strong>Design</strong> (15 min) — Architecture the solution. Draw diagrams. Discuss data flow.</li>
<li><strong>Trade-offs</strong> (10 min) — What could go wrong? What would you do differently with more time?</li>
</ol>
<hr />
<h2>Logistics &amp; Supply Chain (Problems 1-10)</h2>
<h3>Problem 1: Package Routing Optimization</h3>
<p dir="auto"><strong>Scenario:</strong> A shipping company delivers 500K packages daily across 200 cities. They want to reduce delivery times by 15%.</p>
<p dir="auto"><strong>Key questions to ask:</strong></p>
<ul>
<li>What data do they currently collect? (GPS, timestamps, weather, traffic)</li>
<li>What's their current routing system? (Manual? Basic algorithm?)</li>
<li>What does "delivery time" mean? (Warehouse to door? Last mile only?)</li>
</ul>
<p dir="auto"><strong>Approach:</strong></p>
<ul>
<li>Sub-problems: (1) Data ingestion from GPS/IoT, (2) Route optimization algorithm, (3) Real-time re-routing, (4) Performance monitoring dashboard</li>
<li>Start with: Historical data analysis to identify bottleneck routes</li>
<li>Tech: Graph optimization, real-time streaming (Kafka), geospatial queries (PostGIS)</li>
</ul>
<h3>Problem 2: Warehouse Inventory Prediction</h3>
<p dir="auto"><strong>Scenario:</strong> An e-commerce company has 15 warehouses. They over-stock 30% of items and under-stock 20%. Design a system to predict optimal inventory levels.</p>
<h3>Problem 3: Fleet Maintenance Scheduling</h3>
<p dir="auto"><strong>Scenario:</strong> A trucking company has 3,000 vehicles. They want to predict maintenance needs to reduce breakdowns by 50%.</p>
<h3>Problem 4: Port Container Tracking</h3>
<p dir="auto"><strong>Scenario:</strong> A shipping port processes 10,000 containers daily. Containers get lost or delayed. Build a real-time tracking system.</p>
<h3>Problem 5: Last-Mile Delivery Optimization</h3>
<p dir="auto"><strong>Scenario:</strong> A grocery delivery service operates in 5 cities. Each driver makes 20-30 deliveries per shift. Optimize driver assignment and routing.</p>
<hr />
<h2>Healthcare (Problems 6-15)</h2>
<h3>Problem 6: Patient Readmission Prediction</h3>
<p dir="auto"><strong>Scenario:</strong> A hospital network wants to reduce 30-day readmission rates. They have 5 years of patient records across 12 hospitals.</p>
<p dir="auto"><strong>Key questions to ask:</strong></p>
<ul>
<li>What data is available? (EHR, labs, medications, demographics)</li>
<li>HIPAA constraints? Data residency requirements?</li>
<li>What interventions are possible if we predict high risk?</li>
</ul>
<p dir="auto"><strong>Approach:</strong></p>
<ul>
<li>Sub-problems: (1) Data pipeline from EHR systems (HL7/FHIR), (2) Feature engineering, (3) ML model (XGBoost or similar), (4) Clinical dashboard for care teams, (5) Feedback loop for model improvement</li>
<li>Start with: Retrospective analysis on historical readmissions</li>
<li>Critical: Model explainability — clinicians need to understand WHY a patient is flagged</li>
</ul>
<h3>Problem 7: Medical Image Triage</h3>
<p dir="auto"><strong>Scenario:</strong> A radiology department processes 500 scans daily. They want AI to prioritize urgent cases.</p>
<h3>Problem 8: Drug Interaction Alert System</h3>
<p dir="auto"><strong>Scenario:</strong> A pharmacy chain wants real-time alerts when prescriptions have dangerous interactions.</p>
<h3>Problem 9: Clinical Trial Patient Matching</h3>
<p dir="auto"><strong>Scenario:</strong> A pharma company has 50 active clinical trials. They need to match eligible patients faster.</p>
<h3>Problem 10: Hospital Bed Capacity Planning</h3>
<p dir="auto"><strong>Scenario:</strong> A 500-bed hospital frequently runs at 95%+ capacity. Design a prediction system for bed availability.</p>
<hr />
<h2>Financial Services (Problems 11-20)</h2>
<h3>Problem 11: Fraud Detection Pipeline</h3>
<p dir="auto"><strong>Scenario:</strong> A fintech processes 10M transactions daily. Current fraud detection catches 60% of fraudulent transactions with a 5% false positive rate. Improve both metrics.</p>
<p dir="auto"><strong>Key questions to ask:</strong></p>
<ul>
<li>Latency requirements? (Real-time blocking vs. post-transaction review?)</li>
<li>What data is available? (Transaction details, device info, user behavior, merchant data)</li>
<li>What happens when fraud is detected? (Block, flag, require verification?)</li>
</ul>
<p dir="auto"><strong>Approach:</strong></p>
<ul>
<li>Sub-problems: (1) Real-time feature computation, (2) ML scoring pipeline, (3) Rules engine for known patterns, (4) Investigation dashboard, (5) Feedback loop from investigators</li>
<li>Architecture: Streaming pipeline (Kafka → feature store → model serving → decision engine)</li>
<li>Key trade-off: Latency vs. accuracy. Adding more features improves detection but increases latency.</li>
</ul>
<h3>Problem 12: Know Your Customer (KYC) Automation</h3>
<p dir="auto"><strong>Scenario:</strong> A bank spends 45 minutes on average per KYC check. They want to automate 80% of checks.</p>
<h3>Problem 13: Portfolio Risk Dashboard</h3>
<p dir="auto"><strong>Scenario:</strong> An investment firm manages $5B across 200 portfolios. Build a real-time risk monitoring system.</p>
<h3>Problem 14: Loan Default Prediction</h3>
<p dir="auto"><strong>Scenario:</strong> A lending platform wants to predict loan defaults at application time.</p>
<h3>Problem 15: Anti-Money Laundering (AML) Graph Analysis</h3>
<p dir="auto"><strong>Scenario:</strong> A bank needs to detect suspicious transaction networks across 50M accounts.</p>
<hr />
<h2>Defense &amp; Government (Problems 16-25)</h2>
<h3>Problem 16: Satellite Imagery Analysis</h3>
<p dir="auto"><strong>Scenario:</strong> A defense agency receives 10TB of satellite imagery daily. They need to detect changes (new construction, vehicle movement) automatically.</p>
<h3>Problem 17: Cybersecurity Threat Intelligence</h3>
<p dir="auto"><strong>Scenario:</strong> A government network operations center monitors 500K endpoints. They want to reduce mean time to detect threats from 72 hours to 4 hours.</p>
<h3>Problem 18: Disaster Response Resource Allocation</h3>
<p dir="auto"><strong>Scenario:</strong> After a natural disaster, coordinate rescue teams, supplies, and medical resources across an affected region.</p>
<h3>Problem 19: Border Surveillance System</h3>
<p dir="auto"><strong>Scenario:</strong> Monitor 500 miles of border using a combination of sensors, cameras, and drones.</p>
<h3>Problem 20: Supply Chain Security for Critical Infrastructure</h3>
<p dir="auto"><strong>Scenario:</strong> A government agency needs to verify that hardware components haven't been tampered with across a global supply chain.</p>
<hr />
<h2>Retail &amp; E-Commerce (Problems 21-30)</h2>
<h3>Problem 21: Real-Time Pricing Engine</h3>
<p dir="auto"><strong>Scenario:</strong> An e-commerce platform with 5M products wants dynamic pricing based on demand, competition, and inventory.</p>
<h3>Problem 22: Customer Segmentation at Scale</h3>
<p dir="auto"><strong>Scenario:</strong> A retailer with 20M customers wants to create dynamic segments for personalized marketing.</p>
<h3>Problem 23: Store Layout Optimization</h3>
<p dir="auto"><strong>Scenario:</strong> A grocery chain wants to use purchase data and foot traffic to optimize product placement.</p>
<h3>Problem 24: Returns Prediction and Prevention</h3>
<p dir="auto"><strong>Scenario:</strong> An online fashion retailer has a 35% return rate. Predict and reduce returns.</p>
<h3>Problem 25: Omnichannel Inventory Visibility</h3>
<p dir="auto"><strong>Scenario:</strong> A retailer with 500 stores and an e-commerce site wants unified, real-time inventory visibility.</p>
<hr />
<h2>Energy &amp; Manufacturing (Problems 26-35)</h2>
<h3>Problem 26: Predictive Maintenance for Wind Turbines</h3>
<p dir="auto"><strong>Scenario:</strong> A wind farm operator has 200 turbines with IoT sensors. Predict failures before they cause downtime.</p>
<h3>Problem 27: Energy Grid Load Balancing</h3>
<p dir="auto"><strong>Scenario:</strong> A utility company needs to balance supply and demand across a grid with 30% renewable (intermittent) energy.</p>
<h3>Problem 28: Quality Control with Computer Vision</h3>
<p dir="auto"><strong>Scenario:</strong> A manufacturing line produces 50,000 units daily. Detect defects using camera inspection.</p>
<h3>Problem 29: Digital Twin for Factory Operations</h3>
<p dir="auto"><strong>Scenario:</strong> Build a digital twin of a factory floor to simulate and optimize production workflows.</p>
<h3>Problem 30: Carbon Emissions Tracking</h3>
<p dir="auto"><strong>Scenario:</strong> A large corporation needs to track Scope 1, 2, and 3 carbon emissions across their operations and supply chain.</p>
<hr />
<h2>AI / ML Specific (Problems 31-40)</h2>
<h3>Problem 31: Enterprise RAG System</h3>
<p dir="auto"><strong>Scenario:</strong> A law firm has 10M legal documents. Lawyers need to query them using natural language and get accurate, cited answers.</p>
<p dir="auto"><strong>Key questions to ask:</strong></p>
<ul>
<li>Document types? (PDFs, emails, contracts, case law)</li>
<li>Accuracy requirements? (Legal context = very high)</li>
<li>Latency? (Interactive search vs. batch analysis)</li>
</ul>
<p dir="auto"><strong>Approach:</strong></p>
<ul>
<li>Sub-problems: (1) Document ingestion and chunking, (2) Embedding generation, (3) Vector store with metadata filtering, (4) Retrieval pipeline with re-ranking, (5) LLM generation with citations, (6) Evaluation and feedback</li>
<li>Key trade-offs: Chunk size vs. context preservation. Speed vs. accuracy. Cost of LLM calls.</li>
<li>Start with: 1,000 documents, one practice area, measure retrieval quality before scaling.</li>
</ul>
<h3>Problem 32: AI Agent for Customer Support</h3>
<p dir="auto"><strong>Scenario:</strong> A SaaS company handles 50K support tickets monthly. Build an AI agent that resolves 60% automatically.</p>
<h3>Problem 33: LLM-Powered Data Analyst</h3>
<p dir="auto"><strong>Scenario:</strong> A business intelligence team wants non-technical users to query data using natural language.</p>
<h3>Problem 34: Content Moderation at Scale</h3>
<p dir="auto"><strong>Scenario:</strong> A social platform needs to moderate 1M posts daily for harmful content.</p>
<h3>Problem 35: Multi-Modal Search Engine</h3>
<p dir="auto"><strong>Scenario:</strong> A media company has 5M images, videos, and documents. Build a search system that accepts text, image, or audio queries.</p>
<hr />
<h2>Cross-Functional (Problems 36-50)</h2>
<h3>Problem 36: Data Migration Strategy</h3>
<p dir="auto"><strong>Scenario:</strong> Migrate a Fortune 500's data from Oracle + Hadoop to a modern cloud platform with zero downtime.</p>
<h3>Problem 37: Real-Time Recommendation System</h3>
<p dir="auto"><strong>Scenario:</strong> A streaming service wants personalized recommendations updated in real-time as users browse.</p>
<h3>Problem 38: IoT Data Platform</h3>
<p dir="auto"><strong>Scenario:</strong> A smart building company has 100K sensors across 500 buildings. Build a platform for real-time monitoring and analytics.</p>
<h3>Problem 39: Compliance Monitoring System</h3>
<p dir="auto"><strong>Scenario:</strong> A financial institution needs automated monitoring of 500+ regulatory requirements.</p>
<h3>Problem 40: Multi-Tenant SaaS Customization</h3>
<p dir="auto"><strong>Scenario:</strong> Your product serves 200 enterprise customers, each wanting custom workflows. Design a customization layer.</p>
<h3>Problem 41: Event-Driven Architecture Migration</h3>
<p dir="auto"><strong>Scenario:</strong> Migrate a monolithic batch-processing system to event-driven real-time architecture.</p>
<h3>Problem 42: Data Quality Monitoring</h3>
<p dir="auto"><strong>Scenario:</strong> A data platform has 10,000 tables. Build automated data quality checks with alerting.</p>
<h3>Problem 43: API Gateway Design</h3>
<p dir="auto"><strong>Scenario:</strong> A company has 50 microservices. Design an API gateway for external partner access with rate limiting, auth, and versioning.</p>
<h3>Problem 44: Search Infrastructure</h3>
<p dir="auto"><strong>Scenario:</strong> An e-commerce site with 10M products needs search that handles typos, synonyms, and personalized ranking.</p>
<h3>Problem 45: Real-Time Dashboard for Operations</h3>
<p dir="auto"><strong>Scenario:</strong> An operations team needs a dashboard showing real-time metrics from 20 different data sources with &lt;5 second latency.</p>
<h3>Problem 46: Customer Data Platform</h3>
<p dir="auto"><strong>Scenario:</strong> Unify customer data from CRM, website, mobile app, support tickets, and purchase history into a single customer view.</p>
<h3>Problem 47: Feature Store Design</h3>
<p dir="auto"><strong>Scenario:</strong> An ML team has 50 models in production. They're duplicating feature computation. Design a shared feature store.</p>
<h3>Problem 48: Data Marketplace</h3>
<p dir="auto"><strong>Scenario:</strong> A data company wants to let customers discover, preview, and subscribe to datasets through a self-service marketplace.</p>
<h3>Problem 49: Workflow Automation Platform</h3>
<p dir="auto"><strong>Scenario:</strong> A consulting firm has 200 consultants doing repetitive data processing tasks. Build a no-code/low-code automation platform.</p>
<h3>Problem 50: AI-Powered Document Processing</h3>
<p dir="auto"><strong>Scenario:</strong> An insurance company processes 100K claims documents monthly. 80% are still manually reviewed. Automate extraction and classification.</p>
<hr />
<h2>Practice Tips</h2>
<ol>
<li><strong>Time yourself.</strong> 45 minutes per problem. If you can't structure an approach in 5 minutes, your framework needs work.</li>
<li><strong>Draw diagrams.</strong> Interviewers want to see visual thinking. Practice on a whiteboard or drawing tool.</li>
<li><strong>Talk through trade-offs.</strong> There's no single right answer. Show that you understand the implications of your choices.</li>
<li><strong>Ask questions first.</strong> The best FDE candidates spend 20% of the time clarifying the problem.</li>
<li><strong>Start with the simplest version.</strong> Deploy a POC in week 1, iterate based on feedback.</li>
</ol>
<hr />
<p dir="auto"><em>Want to discuss specific solutions? Pick a problem number and post your approach in the replies. Community feedback is the best interview prep.</em></p>
]]></description><link>https://fde.today/topic/40/50-fde-decomposition-and-case-study-problems-with-solutions</link><generator>RSS for Node</generator><lastBuildDate>Sat, 25 Apr 2026 05:55:31 GMT</lastBuildDate><atom:link href="https://fde.today/topic/40.rss" rel="self" type="application/rss+xml"/><pubDate>Sun, 01 Mar 2026 02:09:22 GMT</pubDate><ttl>60</ttl></channel></rss>