Patent Application Titled "Workload-Aware Distributed Data Processing Apparatus and Method for Processing Large Data Based on Hardware Acceleration" Published Online
No assignee for this patent application, patent application serial number 595088, has been made.
Reporters obtained the following quote from the background information supplied by the inventors: "The following description relates to a workload-aware distributed data processing apparatus and method for processing large data based on hardware acceleration.
"With the increasing development of Internet technology, a larger amount of data has been created and distributed over the Internet. In such an environment where a large amount of data is available for use, a variety of companies, for example, portal companies, raise their competitiveness by accumulating a massive amount of data, and extracting and providing significant information to a user on request. Accordingly, various research has been conducted on establishing a large-scale cluster at a reduced cost to enable distributed data processing and distributed data parallel processing.
"A distributed parallel processing programming model supports distributed parallel computation of a large amount of data stored in a cluster that is formed of a large number of nodes at a low cost. The distributed parallel processing programming model includes two steps: a 'Map step' based on a map function made by a user and a 'Reduce step' based on a reduce function. These two steps are performed in turn. However, as the amount of data to be processed increases, reduction of data analysis time to improve performance is more difficult."
In addition to obtaining background information on this patent application, VerticalNews editors also obtained the inventors' summary information for this patent application: "In one general aspect, there is provided a data processing apparatus including a memory buffer including partitions. The data processing apparatus further includes a partition unit configured to distribute a mapping result to the partitions based on a partition proportion scheme. The data processing apparatus further includes a reduce node configured to receive content of a corresponding one of the partitions, and perform a reduction operation on the content to generate a reduce result.
"The data processing apparatus may further include a job controller configured to measure a processing performance of the reduce node, and establish the partition proportion scheme based on the processing performance.
"The mapping result may include a key. The partition unit may include a main processor configured to generate an intermediate code with a fixed size based on the key. The partition unit may further include a proportion controller configured to determine the partitions based on the intermediate code and the partition proportion scheme, generate partition codes corresponding to the respective partitions.
Most Popular Stories
- AIG to Create 230 Jobs in Charlotte
- 15 Myths That Could Ruin Your Hispanic Ad Campaign
- Russia Says Nyet to Canada North Pole Claim
- Bipartisan Negotiators Reach Modest Budget Agreement
- Justin Bieber Visits Typhoon Victims, Plays Concert
- Senate Dems Move Forward With Obama Nominees
- Obama Nominee Confirmed for D.C. Appeals Court
- New Obama Aide to Focus on Climate Change
- MasterCard to Split Shares, Raise Dividend
- GOP, Dems Strain to Unearth a Modest Budget Pact