valve发布dota2ti冠军宣传视频 ti预选赛赛况
淘宝搜:【天降红包222】领超级红包,京东搜:【天降红包222】
淘宝互助,淘宝双11微信互助群关注公众号 【淘姐妹】
在近日的团队会议中,Valve概述了明年《DOTA2》职业巡回赛的一些变化。
TI9资格问题
自从Valve为TI开放地区资格赛起,玩家们就一直对每个区域的资格问题争议不断。据知情人士透露,TI9每个地区资格赛将只有一个晋级名额,六个赛区分别是――中国、东南亚、欧洲、独联体、北美、南美。这意味着在下个赛季积分榜前12名的队伍将获得直邀资格。
职业巡回赛积分问题
很多队伍都在抱怨职业巡回赛积分分配问题,目前的比赛只有第一至四名才能获得DPC积分,V社表示或许将增加积分分配,但尚未决定是前六名还是前八名。
俱乐部赞助问题
Valve要求各队伍拒绝接受任何博彩公司或网站的赞助。现在TI8很多队伍都有着博彩网站或公司的赞助。V社也并不是第一个拒绝博彩赞助的开发商。
OpenAI
OpenAI首次在TI7上公亮相就大放异彩。仅仅两周前,OpenAI Five又击败了前/现任职业选手组成的队伍。为了帮助AI背后的开发团队,Valve希望邀请队伍和AI在TI8上展开人机大赛。
What Companies Does Elon Musk Own?
Elon Musk is a polarizing figure in the tech and business worlds, but no matter what anyone thinks of him, he is an innovative disruptor that is tackling a wide set of issues. He may be best know for his electric vehicle company, Tesla, and his ambitious rocket startup, SpaceX, but he has much more going on. Here’s a quick list of the companies that Musk has founded that are disrupting legacy industries and creating new ones.
Musk revealed that he had taken a 9.2% stake in Twitter worth around $2.9 billion in April 2022, making him the social media company’s largest individual shareholder. Musk’s decision to buy a significant stake in Twitter is the latest development in his rocky relationship with the platform. Over the past few years, Musk has been vocal about his concerns that the company is censoring speech and just last month the billionaire asked his Twitter followers, “Given that Twitter serves as the de facto public town square, failing to adhere to free speech principles fundamentally undermines democracy. What should be done?"
Musk’s share of the company is considered a passive investment by Wall Street, meaning Musk has purchased the shares as a long-term investment, it signals that he is looking to take a more active role in the way social media companies are run. Before his big purchase, Musk tweeted asking his more than 80 million followers if they thought Twitter’s algorithm should be open sourced. He also asked if the company “rigorously adheres” to the principle that “free speech is essential to a functioning democracy.” Alongside the latter tweet, Musk told his followers, "The consequences of this poll will be important. Please vote carefully."
In December 2016, Musk founded The Boring Company after growing frustrated with traffic in Los Angeles and the limitations of two-dimensional transportation networks. The startup aims to create subterranean tunnels that allow pedestrians, freight, utilities, or autonomous electric vehicles to circumvent traffic on surface roads and more directly get from A to B.
The company has proposed multiple projects including a tunnel that would carry passengers between Washington DC and Baltimore, Maryland. The company has an initial test tunnel in Hawthorne, California, and a project under construction in Las Vegas to ferry people around the Las Vegas Convention Center campus.
The Boring Company has also made headlines by selling 20,000 flamethrowers to consumers, which made the company $10 million in revenue. The “Not-a-Flamethrower” could have been a publicity stunt or a way to raise more money without taking on debt or diluting ownership.
Neuralink Corporation was started by Musk in July 2016 and is developing an implantable brain-machine interface. The company initially plans to use the tech to help people with paralysis to regain independence through the control of computers and mobile devices. The company wants the devices to give people the ability to communicate via text or speech synthesis, surf the web, or to express their creativity through photography, art, or writing apps.
In August 2020, Musk unveiled a pig called Gertrude with a coin-sized computer chip in its brain as a proof of concept. According to Musk, Neuralink has obtained an FDA breakthrough device designation which allows the company to conduct limited human testing under the FDA guidelines.
OpenAI is an artificial intelligence research laboratory co-founded by Musk in December 2015. Musk resigned from the company’s board in February 2018 but remains one of its top donors.
OpenAI is seeks to build a machine with human intelligence while prioritizing transparency and safety. In June 2020, the lab released its first commercial product. The tool, dubbed “the API,” allows businesses to directly access OpenAI’s powerful general-purpose text generation AI, which has been trained on trillions of words from the internet. The lab had initially been wary of publishing the full version of the text generation AI, as it could be misused.
SpaceX has made headlines for reusable rockets, space tourism, and its network of satellites that can provide internet connectivity on earth. The exciting space company was founded by Musk in 2002 to reduce space transportation costs and eventually enable the colonization of Mars.
While the company was founded with lofty ambitions it has made significant strides in the aerospace industry. The company completed its first reusable rocket launch in 2015 and in August 2020 the company celebrated a record sixth reuse of a Falcon 9 first stage booster. The company’s progress doesn’t stop there as the company’s Crew Dragon spacecraft is slated to complete its first operational mission launches to the International Space Station at the end of October.
Additionally, in early October 2020 Musk announced the company launched its latest batch of connectivity satellites, and tweeted, “Once these satellites reach their target position, we will be able to roll out a fairly wide public beta in northern US & hopefully southern Canada. Other countries to follow as soon as we receive regulatory approval.”
Perhaps the venture Musk is best known for, Tesla is a game-changing electric vehicle maker that can be in-part be credited with popularizing the idea of battery-electric cars. Musk began the automaker in July 2003, and by 2005 the company had revealed the prototype of its electric car, the Roadster, which went into production in 2008. Now the company offers a wide range of electric vehicles - including an SUV, sports car, and sedan - and soon the company will offer an electric semi-truck and pickup truck.
Since its founding, Tesla has expanded beyond its core automotive business into clean energy generation and storage. In 2016, the company acquired SolarCity, another Musk-founded company, which sold solar power systems. Now a subsidiary of Tesla, SolarCity sells its own solar panels and produces panels in the shape of roof shingles to create a more aesthetically appealing and effective solution. Alongside solar panels, Tesla produces battery systems for customers to store their energy.
X.com was an online bank founded by Elon Musk in November 1999. The company quickly merged with software company Confinity and changed its name to PayPal which was bought by eBay in 2001 for $1.5 billion.
In 2017, Musk bought the rights to X.com from PayPal, citing nostalgia. The website temporarily redirected visitors to The Boring Company's website, but now just features a single black “x” in the top left corner of an all-white page.
Sign up now: Stay up to date, level up and hire better with our behind the scenes newsletters at the world’s top startups.
[DQN] What is Deep Reinforcement Learning
已经成为DL中专门的一派,高大上的样子
?
Intro:
MIT 6.S191 Lecture 6: Deep Reinforcement Learning
Course:
CS 294: Deep Reinforcement Learning?
?
Jan 18: Introduction and course overview (Levine, Finn, Schulman)
- Slides: Levine
- Slides: Finn
- Slides: Schulman
- Video
Why deep reinforcement learning? ? Deep = can process complex sensory input ? …and also compute really complex functions ? Reinforcement learning =can choose complex actions
?
OpenAI 2016年6月21日宣布了其主要目标,包括制造“通用”机器人和使用自然语言的聊天机器人。
?
Q-learningV. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, et al. “Playing Atari with Deep Reinforcement Learning”. (2013). policy gradients J. Schulman, S. Levine, P. Moritz, M. I. Jordan, and P. Abbeel. “Trust Region Policy Optimization”. (2015); V. Mnih, A. P. Badia, M. Mirza, A. Graves, T. P. Lillicrap, et al. “Asynchronous methods for deep reinforcement learning”. (2016). DAGGER X. Guo, S. Singh, H. Lee, R. L. Lewis, and X. Wang. “Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning”. NIPS. 2014. guided policy search S. Levine, C. Finn, T. Darrell, and P. Abbeel. “End-to-end training of deep visuomotor policies”. (2015). policy gradients J. Schulman, P. Moritz, S. Levine, M. Jordan, and P. Abbeel. “High-dimensional continuous control using generalized advantage estimation”. (2015).
?
Finally, AlphaGo 的 四大技术
supervised learning + policy gradients + value functions + Monte-Carlo tree search?
?
?
问题:RE有没有deep,能如何?
回答:Google’s DeepMind published its famous paper?Playing Atari with Deep Reinforcement Learning, in which they introduced a new algorithm called?Deep Q Network?(DQN for short) in 2013. It demonstrated how an AI agent can learn to play games by just observing the screen without any prior information about those games(无信息先验?). The result turned out to be pretty impressive.
This paper opened the era of what is called ‘deep reinforcement learning’, a mix of deep learing and reinforcement learning.
?
Then, 通过实践了解这个牛牛的网络:Deep Q Learning with Keras and Gym
外加一个有良心的国内博客:用Tensorflow基于Deep Q Learning DQN 玩Flappy Bird?(课外阅读)
?
CartPole is one of the simplest environments in OpenAI gym (a game simulator).
当然了,有经费,也可以这么搞个真玩意。
?
As you can see in the animation from the top, the goal of CartPole is to balance a pole connected with one joint on top of a moving cart.
Instead of pixel information, there are 4 kinds of information given by the state, such as angle of the pole and position of the cart.
An agent can move the cart by performing a series of actions of 0 or 1 to the cart, pushing it left or right.
Gym makes interacting with the game environment really simple.
next_state, reward, done, info=env.step(action) |
学习的输入参数,要具体问题具体分析。
As we discussed above, action can be either 0 or 1.
If we pass those numbers,?, which represents the game environment, will emit the results.is a boolean value telling whether the game ended or not.
The old?information paired withand??and??is the information we need for training the agent.
?
This post is not about deep learning or neural net. So we will consider neural net as just a black box algorithm.
An algorithm that learns on the pairs of example input and output data, detects some kind of patterns, and predicts the output based on an unseen input data.
But we should understand which part is the neural net in the DQN algorithm.
DQN 算法中哪里涉及神经网络
Note that the neural net we are going to use is similar to the diagram above.
We will have one input layer that receives 4 information and 3 hidden layers. 输入层
But we are going to have 2 nodes in the output layer since there are two buttons (0 and 1) for the game.
Keras makes it really simple to implement basic neural network.
The code below creates an empty neural net model.?
,andare the parameters that define the characteristics of the neural network, but we are not going to discuss it here.
原来如此结合,有机会实现下,目前不着急。
?
?
Ref: http://www.cnblogs.com/Leo_wl/p/5852010.html
?
基于NEAT算法的马里奥AI实现
所谓NEAT算法即通过增强拓扑的进化神经网络(Evolving Neural Networks through Augmenting Topologies),算法不同于我们之前讨论的传统神经网络,
-
- 它不仅会训练和修改网络的权值,
- 同时会修改网络的拓扑结构,包括新增节点和删除节点等操作。
NEAT算法几个核心的概念是:
-
- 基因:网络中的连接
- 基因组:基因的集合
- 物种:一批具有相似性基因组的集合
- Fitness:有点类似于增强学习中的reward函数
- generation:进行一组训练的基因组集合,每一代训练结束后,会根据fitness淘汰基因组,并且通过无性繁殖和有性繁殖来新增新的基因组
- 基因变异:发生在新生成基因组的过程中,可能会出现改变网络的权重,增加突出连接或者神经元,也有可能禁用突触或者启用突触
下图我们展示了算法从最一开始简单的神经网络,一直训练到后期的网络