博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
云计算核心论文 .
阅读量:6694 次
发布时间:2019-06-25

本文共 8680 字,大约阅读时间需要 28 分钟。

转自

Google 
1. nosqldbs-NOSQL Introduction and Overview 
2. system and method for data distribution(2009) 
3. System and method for large-scale data processing using an application-independent framework(2010) 
4. MapReduce: Simplified Data Processing on Large Clusters; 
5. MapReduce-- a flexible data processing tool(2010) 
6. Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters 
7. MapReduce and Parallel DBMSs--Friends or Foes(2010) 
8. Presentation:MapReduce and Parallel DBMSs:Together at Last (2010) 
9. Twister: A Runtime for Iterative MapReduce(2010) 
10. MapReduce Online(2009) 
11. Megastore: Providing Scalable, Highly Available Storage for Interactive Services (2011,CIDR) 
12. Interpreting the Data:Parallel Analysis with Sawzall 
13. Dapper, a Large-Scale Distributed Systems Tracing Infrastructure (technical  report 2010) 
14. Large-scale Incremental Processing Using Distributed Transactions and Notifications(2010) 
15. Improving MapReduce Performance in Heterogeneous Environments 
16. Dremel: Interactive Analysis of WebScale Datasets(2011) 
17. Large-scale Incremental Processing Using Distributed Transactions and Notifications 
18. Chukwa: a scalable cloud monitoring System (presentation) 
19. The Chubby lock service for loosely-coupled distributed systems 
20. Paxos Made Simple(2001,Lamport) 
21. Fast Paxos(2006) 
22. Paxos Made Live - An Engineering Perspective(2007) 
23. Classic Paxos vs. Fast Paxos: Caveat Emptor 
24. On the Coordinator’s Rule for Fast Paxos(2005) 
25. Paxos  made code:Implementing a high throughput Atomic Broadcast (2009) 
26. Bigtable: A Distributed Storage System for Structured Data(2006) 
27. The Google File System 
Google patent papers 
1. Data processing system and method for financial debt instruments(1999) 
2. Data processing system and method to enforce payment of royalties when copying softcopy books(1996) 
3. Data processing systems and methods(2005) 
4. Large-scale data processing in a distributed and parallel processing environment(2010) 
5. METHODS AND SYSTEMS FOR MANAGEMENT OF DATA() 
6. SEARCH OVER STRUCTURED DATA(2011) 
7. System and method for maintaining replicated data coherency in a data processing system(1995) 
8. System and method of using data mining prediction methodology(2006) 
9. System and Methodology for Data Processing Combining Stream Processing and spreadsheet computation(2011) 
10. Patent Factor index report of system and method of using data mining prediction methodology 
11. Pregel: A System for Large-Scale Graph Processing(2010) 
Hadoop 
1. A simple totally ordered broadcast protocol 
2. ZooKeeper: Wait-free coordination for Internet-scale systems 
3. Zab: High-performance broadcast for primary-backup systems(2011) 
4. wait-free syschronization(1991) 
5. ON SELF-STABILIZING WAIT-FREE CLOCK SYNCHRONIZATION(1997) 
6. Wait-free clock synchronization(ps format) 
7. Programming with ZooKeeper - A basic tutorial 
8. Hive – A Petabyte Scale Data Warehouse Using Hadoop 
9. Thrift: Scalable Cross-Language Services Implementation(Facebook) 
10. Hive other files: HiveMetaStore class picture, Chinese docs 
11. Scaling out data preprocessing with Hive (2011) 
12. HBase The Definitive Guide - 2011 
13. Nova: Continuous Pig/Hadoop Workflows(yahoo,2011) 
14. Pig Latin: A Not-So-Foreign Language for Data Processing(2008) 
15. Analyzing Massive Astrophysical Datasets: Can Pig/Hadoop or a Relational DBMS Help?(2009) 
a. Some docs about HStreaming,Zebra 
16. HIPI: A Hadoop Image Processing Interface for Image-based MapReduce Tasks 
17. System Anomaly Detection in Distributed Systems through MapReduce-Based Log Analysis(2010) 
18. Benchmarking Cloud Serving Systems with YCSB(2010) 
19. Low-Latency, High-Throughput Access to Static Global Resources within the Hadoop Framework (2009) 
SmallFile Combine in hadoop world 
1. TidyFS: A Simple and Small Distributed File System(Microsoft) 
2. Improving the storage efficiency of small files in cloud storage(chinese,2011) 
3. Comparing Hadoop and Fat-Btree Based Access Method for Small File I/O Applications(2010) 
4. RCFile: A Fast and Space-efficient Data Placement Structure in MapReduce-based Warehouse Systems(Facebook) 
5. A Novel Approach to Improving the Efficiency of Storing and Accessing Small Files on Hadoop: a Case Study by PowerPoint Files(IBM,2010) 
Job schedule 
1. Job Scheduling for Multi-User MapReduce Clusters(Facebook) 
2. MapReduce Scheduler Using Classifiers for Heterogeneous Workloads(2011) 
3. Performance-Driven Task Co-Scheduling for MapReduce Environments 
4. Towards a Resource Aware Scheduler in Hadoop(2009) 
5. Delay Scheduling: A Simple Technique for Achieving 
6. Locality and Fairness in Cluster Scheduling(yahoo,2010) 
7. Dynamic Proportional Share Scheduling in Hadoop(HP) 
8. Adaptive Task Scheduling for MultiJob MapReduce Environments(2010) 
9. A Dynamic MapReduce Scheduler for Heterogeneous Workloads(2009) 
HStreaming 
1. HStreaming Cloud Documentation 
2. S4: Distributed Stream Computing Platform(yahoo,2010) 
3. Complex Event Processing(2009) 
4. Hstreaming : http://www.hstreaming.com/resources/manuals/ 
5. StreamBase: http://streambase.com/developers-docs-pdfindex.htm 
6. Twitter storm: http://www.infoq.com/cn/news/2011/09/twitter-storm-real-time-hadoop 
7. Bulk Synchronous Parallel(BSP) computing 
8. MPI 
SQL/Mapreduce 
1. Aster Data whilepaper:Deriving Deep Insights from Large Datasets with SQL-MapReduce (2004) 
2. SQL/MapReduce: A practical approach to self-describing,polymorphic, and parallelizable user-defined functions(2009,aster) 
3. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads(2009) 
4. HadoopDB in Action: Building Real World Applications(2010) 
5. Aster Data presentation: Making Advanced Analytics on Big Data Fast and Easy(2010) 
6. A Scalable, Predictable Join Operator for 
7. Highly Concurrent Data Warehouses(2009) 
8. Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce(2010) 
9. Greenplum whilepaper:A Unified Engine for RDBMS and MapReduce(2004) 
10. A Comparison of Approaches to Large-Scale Data Analysis(2009) 
11. MAD Skills: New Analysis Practices for Big Data (2009) 
12. C Store A Column oriented DBMS(2005) 
13. Distributed Aggregation for Data-Parallel Computing: Interfaces and Implementations(Microsoft) 
Microsoft 
1. Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks (2007) 
Amazon 
1. Dynamo: Amazon’s Highly Available Key-value Store(2007) 
2. Efficient Reconciliation and Flow Control for Anti-Entropy Protocols 
3. The Eucalyptus Open-source Cloud-computing System 
4. Eucalyptus: An Open-source Infrastructure for Cloud Computing(presentation) 
5. Eucalyptus : A Technical Report on an Elastic Utility Computing Archietcture Linking Your Programs to Useful Systems (2008) 
6. Zephyr: Live Migration in Shared Nothing Databases for Elastic Cloud Platforms(2011) 
7. Database-Agnostic Transaction Support for Cloud Infrastructures 
8. CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems(2011) 
9. ELT: Efficient Log-based Troubleshooting System for Cloud Computing Infrastructures 
Books 
1. Distributed Systems Concepts and Design (5th Edition) 
2. Principles of Computer Systems (7-11) 
3. Distributed system(chapter) 
4. Data-Intensive Text Processing with MapReduce (2010) 
5. Hadoop in Action 
6. 21 Recipes for Mining Twitter 
7. Hadoop.The.Definitive.Guide.2nd.Edition 
8. Pro hadoop 
Other papers about Distributed system 
1. Flexible Update Propagation for Weakly Consistent Replication(1997) 
2. Providing High Availability Using Lazy Replication(1992) 
3. Managing Update Conflicts in Bayou,a Weakly Connected Replicated Storage System(1995) 
4. XMIDDLE: A Data-Sharing Middleware for Mobile Computing(2002) 
5. design and implementation of sun network filesystem 
6. Chord: A Scalable Peertopeer Lookup Service for Internet Applications(2001) 
7. A Survey and Comparison of Peer-to-Peer Overlay Network Schemes(2004) 
8. Tapestry: An Infrastructure for Fault-tolerant Wide-area Location and Routing(2001) 
BI 
1. 21 Recipes for Mining Twitter(Book) 
2. Web Data Mining(Book) 
3. Web Mining and Social Networking(Book) 
4. mining the social web(book) 
5. TEXTUAL BUSINESS INTELLIGENCE (Inmon) 
6. Social Network Analysis and Mining for Business Applications(yahoo,2011) 
7. Data Mining in Social Networks(2002) 
8. Natural Language Processing with Python(book) 
9. data_mining-10_methods(Chinese editation) 
10. Mahout in Action(Book) 
11. Text Mining Infrastructure in R(2008) 
12. Text Mining Handbook(2010) 
Web search engine 
1. Building Efficient Multi-Threaded Search Nodes(Yahoo,2010) 
2. The Anatomy of a Large-Scale Hypertextual Web Search Engine(google) 

转载于:https://www.cnblogs.com/jie465831735/archive/2013/03/06/2946982.html

你可能感兴趣的文章
Docker 命令导图
查看>>
hdu1698(线段树懒惰法)
查看>>
JavaScript编程:使用DOM操作样式表
查看>>
[android] 手机卫士项目
查看>>
7. mybatis实战教程(mybatis in action)之八:mybatis 动态sql语句
查看>>
debian下安装mysql
查看>>
给iOS开发者的GCD用户手册
查看>>
044 HIVE中的几种排序
查看>>
Javascript 运行上下文和作用域链
查看>>
Tomcat服务器安装
查看>>
Atcoder训练计划
查看>>
新一日三省吾身
查看>>
ABP框架 - 验证数据传输对象
查看>>
是否值得付费?Oracle,Open JDK等四大JVM性能全面对比
查看>>
49岁程序员:如何靠编程实现财务、精神双自由?
查看>>
get荣耀YOYO智能音箱打电话的新技能 感觉赚了1个亿
查看>>
明明长得一样,怎么就不认识你了呢
查看>>
支付宝风控闹“乌龙”,网友却纷纷点赞叫好
查看>>
2017年度华为感动之星巅峰之旅活动完美收官
查看>>
高瓴资本大幅减持虎牙:半年时间闪进闪出
查看>>