Professional Documents
Culture Documents
Building A Better Data Warehouse For More Data, More Speed, And More Users
Raymond Gardiner Goss
raymond.goss@globalfoundries.com
Kousikan Veeramuthu
kousikan.veeramuthu@globalfoundries.com
Manufacturing Technology
GLOBALFOUNDRIES
Malta, NY, USA
determine the callers income level and specify to which agent
to route the call or the switch would timeout. When switches
were overwhelmed with data, they would drop packets and
algorithms had to infer states based on most probable current
state. Other industries, such as social media, are challenged
more by unstructured data and need tools to help turn text
messages and photos into useful information for search engines
and marketing purposes. The challenge in the semiconductor
world is with the size of the data. Speed becomes a secondary
problem because so many sources are needed to be joined
together in a timely manner. Large recipes, complex output
from the test floor combined now with more Interface-A trace
data amass terabytes each month that need to be handled for
both real-time SPC, APC, and command and control scenarios
as well as offline yield analyses. Users now require real-time
access to data from a much larger pool of sources. This paper
describes the various states of handling the increasing
complexity and volumes today and the challenges ahead.
I.
Real-Time,
II.
Analysis,
INTRODUCTION
Not long ago, the price of gasoline was less expensive and
we drove cars based on features we desired like roof racks,
cargo space, sporty looks, and prestige, but the world is
changing. The cost of fuel has increased and we are more
aware of the environmental concerns. We are switching to
vehicles with different engines that go much further with less
energy, but still expect all the new features of built-in GPS,
backup cameras and keyless ignition. The move to Big Data
will be a similar paradigm shift. The principle analysis is the
same, but the engines and amount of data are changing.
B. Complexity Example
Big Data covers a range of situations, all with the common
theme of more more variety, more quantity, more users,
more speed, more complexity. There are currently different
Big Data solution approaches to each of these. Let us start off
with an example of determining root cause and correlation of a
220
ASMC 2013
Ad Hoc
Ad Hoc
Ad Hoc App
Factory Systems
UI
framew ork
SPC
FDC
Setup
Decison
Serv ices &
Integration
Replication
MQ Message Bus
EI
Dispatch &
Scheduling
APC
Other Fabs/Corporate
Systems
CMMS
RM S
eTEST
Data
Warehouse
(GEM-D)
Business Analysis
and Reporting
221
ASMC 2013
C. Automated Algorithms
Big Data can feed advanced analytics and algorithms to
vastly improve the decision making process and identify
valuable insights which were previously hidden or not easily
available. Fabs can adjust production lines automatically to
optimize efficiency, reduce waste, and avoid dangerous
conditions. At GLOBALFOUNDRIES, we are already using
controlled experiments to make better decisions by embedding
real-time, highly granular data from networked sensors in the
supply chain and production processes. Automating the
analysis of the data reported by sensors embedded in complex
products combined with the tool owners input enables
manufacturers to create proactive smart preventive
maintenance service. Service personnel can perform the PM
operations before there is an equipment failure which may
cause costly fab disruptions. Also this enables the fab to have
a new business model like leasing the portions of fab space for
specific customers based on the sensor data.
D. Virtual Factory.
Taking product development and historical data and real-time
inputs from MES data, fabs can apply advanced computational
methods to create a digital model of the entire manufacturing
process. This model can be used to design and simulate the
most efficient production system. Some of the applications of
virtual factory include: 1. validation of designed production
concept; 2. processes verification before start of production; 3.
optimization of production equipment allocation; 4. bottlenecks
and collisions analysis; 5. better utilization of existing
resources; 6. eliminating errors in the production line. Fab
engineers can leverage the power of Big Data to simulate these
operations with millions of different combinations to optimally
schedule and dispatch WIP.
IV.
B. Experimental Analysis
Experimental analysis is critical in many areas of our fabs.
Ability to process huge amounts of data will open up the
possibility of conducting new tests and analysis which were
unimaginable earlier. This will have a dramatic effect in areas
like R&D and will help achieve faster time to market.
222
ASMC 2013
Current State
GEM-D Staging
Layer
Integrated Identity
Mgmt
Evaluating compression,
In-memory analysis and
hardware solutions
GEM-D Logical
layer
New BI
Tools &
Reports
New
Analysis
Tools
223
ASMC 2013
VI.
V.
The path for the next level of data management is not yet
defined. Reporting and analysis has already moved beyond the
single system or data set. The factory data volumes are
expanding rapidly. The reader can see from Fig. 3 that there
are significant tradeoffs with using these solutions. The
appliance covers a subset of the traditional RDBMS whereas
the Hadoop paradigm covers new territory.
ACKNOWLEDGMENT
The authors would like to thank the Data Integration and IT
DBA teams at GLOBALFOUNDRIES in deploying GEM-D
and making the current solutions a reality.
CONCLUSION
224
ASMC 2013
Pros
Appliance
RDBMS
Hadoop
In Memory DB
SSD
Cons
[5]
[1]
[2]
[3]
[4]
http://blogs.gartner.com/merv-adrian/2013/02/10/hadoop-and-di-aplatform-is-not-a-solution/
[6] http://www.saphana.com/community/blogs/blog/2012/04/30/whatoracle-wont-tell-you-about-sap-hana
[7] http://www.dbms2.com/2012/08/20/in-memory-hybrid-memorycentric/
[8] http://hadoop.apache.org/
[9] http://en.wikipedia.org/wiki/MapReduce
[10] http://en.wikipedia.org/wiki/Big_data
[11] http://www.datanami.com/datanami/2012-0213/big_data_and_the_ssd_mystique.html
REFERENCES
J Manyika, M Chui, B Brown, J Bughin, R Dobbs, C Roxburgh, A H
Byers, McKinsey Global Institute Publication "Big data: The next
frontier for innovation, competition, and productivity"
J G Kobielus, The Forrester Wave: Enterprise Hadoop Solutions,
Q1 2012
A McClean, R. C. Conceio, M OHalloran, "A Comparison of
MapReduce and Parallel Database Management Systems," ICONS
2013, The Eighth International Conference on Systems.
"Big Data Now: 2012 Edition" OReilly Media, Inc.
225
ASMC 2013