计算EC2现货实例的运行/累积成本
问题描述:
我经常在EC2上运行现货实例(用于Hadoop任务作业,临时节点等)。其中一些是长期运行的现货实例。计算EC2现货实例的运行/累积成本
它很容易计算按需或保留的EC2实例的成本 - 但是,如何计算作为专有实例运行的特定节点(或多个节点)所产生的成本?
我知道现货实例的成本会根据市场价格每小时变化 - 那么是否有任何方法可以计算正在运行的现货实例的累计总成本?通过API或其他方式?
答
我已经重新编写网速慢的解决方案与boto3工作。确保使用utctime与tz设置!:
def get_spot_instance_pricing(ec2, instance_type, start_time, end_time, zone):
result = ec2.describe_spot_price_history(InstanceTypes=[instance_type], StartTime=start_time, EndTime=end_time, AvailabilityZone=zone)
assert 'NextToken' not in result or result['NextToken'] == ''
total_cost = 0.0
total_seconds = (end_time - start_time).total_seconds()
total_hours = total_seconds/(60*60)
computed_seconds = 0
last_time = end_time
for price in result["SpotPriceHistory"]:
price["SpotPrice"] = float(price["SpotPrice"])
available_seconds = (last_time - price["Timestamp"]).total_seconds()
remaining_seconds = total_seconds - computed_seconds
used_seconds = min(available_seconds, remaining_seconds)
total_cost += (price["SpotPrice"]/(60 * 60)) * used_seconds
computed_seconds += used_seconds
last_time = price["Timestamp"]
# Difference b/w first and last returned times
avg_hourly_cost = total_cost/total_hours
return avg_hourly_cost, total_cost, total_hours
答
好的我在Boto库中找到了一个这样做的方法。这段代码并不完美 - 博托似乎没有返回确切的时间范围,但它确实在一定范围内或多或少地获得了历史现货价格。下面的代码似乎工作得很好。如果任何人都可以改进它,那会很棒。
import boto, datetime, time
# Enter your AWS credentials
aws_key = "YOUR_AWS_KEY"
aws_secret = "YOUR_AWS_SECRET"
# Details of instance & time range you want to find spot prices for
instanceType = 'm1.xlarge'
startTime = '2012-07-01T21:14:45.000Z'
endTime = '2012-07-30T23:14:45.000Z'
aZ = 'us-east-1c'
# Some other variables
maxCost = 0.0
minTime = float("inf")
maxTime = 0.0
totalPrice = 0.0
oldTimee = 0.0
# Connect to EC2
conn = boto.connect_ec2(aws_key, aws_secret)
# Get prices for instance, AZ and time range
prices = conn.get_spot_price_history(instance_type=instanceType,
start_time=startTime, end_time=endTime, availability_zone=aZ)
# Output the prices
print "Historic prices"
for price in prices:
timee = time.mktime(datetime.datetime.strptime(price.timestamp,
"%Y-%m-%dT%H:%M:%S.000Z").timetuple())
print "\t" + price.timestamp + " => " + str(price.price)
# Get max and min time from results
if timee < minTime:
minTime = timee
if timee > maxTime:
maxTime = timee
# Get the max cost
if price.price > maxCost:
maxCost = price.price
# Calculate total price
if not (oldTimee == 0):
totalPrice += (price.price * abs(timee - oldTimee))/3600
oldTimee = timee
# Difference b/w first and last returned times
timeDiff = maxTime - minTime
# Output aggregate, average and max results
print "For: one %s in %s" % (instanceType, aZ)
print "From: %s to %s" % (startTime, endTime)
print "\tTotal cost = $" + str(totalPrice)
print "\tMax hourly cost = $" + str(maxCost)
print "\tAvg hourly cost = $" + str(totalPrice * 3600/ timeDiff)
答
您可以订阅现场实例数据馈送,以获得转储到S3存储桶的正在运行的实例的费用。安装EC2工具集,然后运行:
ec2-create-spot-datafeed-subscription -b bucket-to-dump-in
注意:你只能有一个数据馈给订阅整个帐户。
在你应该开始看到gzip压缩选项卡分隔的文件显示在斗一小时才得知这个样子:
#Version: 1.0
#Fields: Timestamp UsageType Operation InstanceID MyBidID MyMaxPrice MarketPrice Charge Version
2013-05-20 14:21:07 UTC SpotUsage:m1.xlarge RunInstances:S0012 i-1870f27d sir-b398b235 0.219 USD 0.052 USD 0.052 USD 1
答
我最近开发了计算单个EMR的成本小Python库群集或群集列表(给定一段时间)。
它还考虑了竞价型实例和任务节点(可能会在群集仍在运行时上下移动)。
为了计算我使用的出价,其中(很多情况下)可能不是您最终为实例支付的确切价格。 但是,根据您的出价政策,此价格可能足够准确。
您可以在这里找到代码:https://github.com/memosstilvi/emr-cost-calculator